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RAPIDLY DEGRADED REPORTER FUSION PROTEINS 



Field of the Invention 

This invention relates to the field of biochemical assays and reagents. 
5 More specifically, this invention relates to modified reporter proteins, e.g., 
fluorescent reporter proteins, and to methods for their use. 



Background of the Invention 

Luciferases are enzymes that catalyze the oxidation of a substrate 
. 10 (luciferin) with the concomitant release of photons of light. Luciferases have 
been isolated from numerous species, including Coleopteran arthropods and 
many sea creatures. Because it is easily detectable and its activity can be 
quantified with high precision, luciferase/luciferin enzyme/substrate pairs have 
been used widely to study gene expression and protein localization. Unlike 

1 5 another reporter protein, green fluorescent protein (GFP), which requires up to 
30 minutes to form a chromophore, the products of luciferases can be detected 
immediately upon completion of synthesis of the polypeptide chain (if substrate 
and oxygen are also present). As luciferase is a usefiil reporter in numerous 
species and in a wide variety of cells, luciferases are ideal for monitoring gene 

20 up-regulation. However, the stability of native luciferases and native GFP can 
fiinctionally mask rehable detection of gene down-regulation. 

Protein degradation is necessary to rid cells of damaged and npn- 
fimctioning proteins. Intracellular degradation of proteins is a highly selective 
process that allows some proteins to survive for hours or days, while other 

25 proteins survive for only minutes, inside the cell. In recent years, the processes 
controlling protein degradation have become an important area of study, fiirther 
prompted perhaps by reports that the failme of key components in protein 
degradation can be causative in human disease (Bence et al., 2001; McNaught et 
al., 2001). 

30 Protein degradation is not limited to the removal of damaged or 

otherwise abnormal proteins, as a number of regulatory circuits involve proteins 
with short half-hves (relatively "unstable" proteins). For example, proteolysis 
plays an unportant regulatory role in many cellular processes including 
metabolic control, cell cycle progression, signal transduction and transcription 
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(Hicke, 1997; Joazeiro et al., 1999; Murray et al., 1989; Salghetti et al., 2001). 
A great part of selective protein degradation in eukaryotes appears to be carried 
out in the proteosome, an ATP-dependent multi-protein complex. In many 
degradation pathways, the covalent conjugation of ubiquitin, a 76 amino acid 
5 polypeptide, to proteins destined for degradation, precedes degradation in the 
proteosome (Hershko et al., 1992). 

Ornithine decarboxylase (ODC) is an enzyme which is critical in the 
biosynthesis of polyamines and is known to have a very short cellular half-life. 
In fact, murine ornithine decarboxylase (mODC) is one of most short-lived 
10 proteins, with a half-life of about 30 minutes (see Ghoda et al, 1989; Ghoda et 
al. 1992). Rapid degradation of ODC has been attributed to the unique 
composition of its C-terminus which mcludes a "PEST" sequence (Rogers et al., 
1989; Reichsteiner, 1990). A PEST sequence contains a region enriched with 
proline (P), glutamic acid (E), serine (S), and threonine (T), that is often flanked 
15 by basic amino acids, lysine, arginine, or histidine. The PEST sequence targets 
the PEST containing protein towards the 26S proteosome without prior 
ubiquitinization (Gilon et al, 1998; Leclerc et al., 2000; Corish et al., 1999; Li et 
al., 1998; Li et al, 2000). Deletion of the C-terminal PEST containing region 
from mODC prevents its rapid degradation (Ghoda et al, 1989). In contrast to 
20 mODC, the ODC of Tiypanosoma brucei (TbODC) does not have a PEST 

sequence, and is long-Uved and quite stable when expressed in mammaUan cells 
(Ghoda et al. 1990). However, fusion of the C-terminus of mODC to TbODC 
results in an unstable protein. 

One group has developed destabilized reporter proteins by fusing the 
25 coding sequence of the reporter with a destabilization sequence which either 
destabilizes mRNA encoding the reporter or destabiUzes the reporter 
polypeptide. For instance, the C-terminal region of ODC (C-ODC) has been 
shown to reduce the half-Hfe of GFP from about 26 hours to about 9.8 hours 
(Corish, 1999) and, when a mutant form of C-ODC was used, to less than 2 
30 hours (Li, 1998; Li, 2000). Moreover, a PEST sequence has been shown to 
reduce the half-Ufe of firefly luciferase from about 3.68 hours to about 0.84 
hours (Leclerc et al, 2000). Fan et al. (1997) found that the presence of an AU- 
rich region from a herpes virus RNA conferred instabiUty to that RNA as well as 
to heterologous RNAs, thereby destabilizing the mRNA. 
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Peptide signals other than C-ODC that have been used for destabiUzation 
of proteins include the cyclin destruction box (Corish et al., 1999; King et al, 
1996), the PEST-rich C-terminal region of cyclin (Mateus et al, 2000), CL 
peptides, e.g., CLl (Gilon et al., 1998; Bence et al, 2001) and N-degron. 
5 Although all of these signals direct proteins containing them towards 

degradation by the proteosome, the pathways followed by these proteins before 
they reach the proteosome may be different. For instance, degradation of ODC 
occurs in the 26S proteosome in the absence of prior ubiquitination (Bercovich 
et al, 1989; Murakami et al, 1992), while CL peptides channel proteins for 
10 degradation via a pathway that depends upon the presence of ubiquitin- 
conjugating enzymes (Gilon et al, 1998). Degradation of cycUns is also a 
ubiquitination-dependent process (Glotzer et al, 1991). Nevertheless, cycUns 
become unstable only when cells exit mitosis (Hunt et al, 1992). 

More than fifteen years ago Bachmir et al (1986) described the N-degron 
15 degradation signal These authors reported that certain amino acid residues, if 
positioned at the N-terminus of the protein, can stimulate the use of internal 
protein lysine residues by ubiquitin Hgase as a point for attachment of ubiquitin. 
As a result such N-terminal residues (called destabilizing residues) caused 
dramatic destabilization of the corresponding protein. The relationship between 
20 the identity of the N-terminal residue and the in vivo half-Ufe of the 

corresponding protein has been referred by these authors as the N-end rule 
(Varshavsky, 1992). This mle has been extensively studied in yeast where 
species of p-galactosidase with N-terminal Arg, Lys, Phe, Leu, Trp, His, Asp, 
Asn, Tyr, Gki, He or Glu had half-lives of 2-30 minutes. At the same time, p- 
25 galactosidase species with any of the other eight amino acid residues had half- 
lives of more than 20 hours (Varshavsky, 1992). The most efficient 
destabilizing residue with a corresponding half-Ufe of 2 minutes was found to be 
arginine. It has been reported that the N-end rule also operates in E. coli, where 
an arginine residue was found to be the most effective destabihzing residue 
30 (Tobias et al, 1991). In E. coli, similarly to yeast, positioning of this residue at 
the N-terminus of p-galactosidase resulted in a protein that had a half-Ufe of 2 
minutes instead of more than 10 hours (Varshavsky, 1992; Tobias et al., 1991). 
Several groups have reported data supportmg the existence of the N-end rule in 
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rabbit, mouse and tobacco (Varshavsky, 1992; Reiss et al, 1988; Kwon et al., 
1998; Townsend et al., 1988). 

However, what is needed is an improved recombinant reporter protein, 
e.g., for use in higher.eukaryotes. 

5 

Summary of the Invention 

The invention provides improved gene products, e.g., reporter proteins, 
with reduced or decreased, e.g., substantially reduced or decreased, half-lives, of 
expression, which are useful to detemiine or detect gene expression, e.g., up- or 
10 down-regulation, to monitor promoter activity, to reduce cytotoxicity, and to 
localize proteins In one embodiment, the invention provides an isolated nucleic 
acid molecule (polynucleotide) comprising a nucleic acid sequence encoding a 
fusion polypeptide comprising a reporter protein, e.g., a luciferase, GFP, 
chloramphenicol acetyltransferase, beta-glucuronidase or beta-galactosidase, 
1 5 which nucleic acid molecule comprises at least two heterologous destabiUzation 
sequences, e.g., encoding at least two heterologous protein destabiUzation 
sequences, or encoding at least one heterologous protein destabiUzation 
sequence and comprising at least one heterologous mRNA destabiUzation 
sequence. As used herem, a ''heterologous" destabiUzation sequence is one 
20 which is not found in the wild-type gene for the reporter protein employed in the 
fusion polypeptide. The presence of one or more destabiUzation sequences in a 
nucleic acid molecule of the invention which is introduced to a host ceU or to an 
in vitro transcription/translation mixture, results in reporter activity (expression) 
that is reduced or decreased, e.g., a substantiaUy reduced or decreased half-life 
25 of reporter expression, relative to the reporter activity for a corresponding 
reporter protein gene that lacks one or more of the destabiUzation sequences. 
For example, the presence of one or more protein destabiUzation sequences in a 
fusion polypeptide encoded by a nucleic acid molecule of the invention results in 
a reduction or decrease in the half-Ufe of the fusion polypeptide relative to a 
30 corresponding protein which lacks the destabiUzation sequence(s). The presence 
of one or more RNA destabiUzation sequences in a nucleic acid molecule of the 
invention results in a reduction or decrease in the half-Ufe of the mRNA 
transcribed from that nucleic acid molecule relative to a nucleic acid molecule 
which lacks the destabiUzation sequence(s). Preferably, tiie nucleic acid 
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molecule of the invention comprises sequences which have been optimized for 
expression in mammalian cells, and more preferably, in human cells (see, e.g., 
WO 02/16944 which discloses methods to optimize sequences for expression in 
a cell of interest). For instance, nucleic acid molecules may be optimized for 
5 expression in eukaryotic cells by introducing a Kozak sequence and/or one or 
more introns, and/or by altering codon usage to codons employed more 
jfrequently in one or more eukaryotic organisms, e.g., codons employed more 
frequently ia an eukaryotic host cell to be transformed with the nucleic acid 
molecide. 

10 A protein destabiUzation sequence includes one or more amino acid 

residues, which, when present at the N-terminus or C-terminus of a protein of 
interest, reduces or decreases, e.g., having a reduction or decrease in the half-Ufe 
of the protein of interest of at least 80%, preferably at least 90%, more 
preferably at least 95% or more, e.g., 99%, relative to a corresponding protein 

1 5 which lacks the protein destabilization sequence. The presence of the protein 

destabiUzation sequence in a fusion polypeptide preferably does not substantially 
alter other functional properties of the protein of interest. In one embodiment, a 
protein destabilization sequence has less than about 200 amino acid residues. A 
protein destabiUzation sequence includes, but is not limited to, a PEST sequence, 

20 for example, a PEST sequence from cyclin, e.g., mitotic cyclins, macil permease 
or ODC, a sequence from the C-terminal region of a short-Uved protein such as 
ODC, early response proteins such as cytokines, lymphokines, protooncogenes, 
e.g., c-myc or c-fos, MyoD, HMG CoA reductase, S-adenosyl methionine 
decarboxylase, CL sequences, a cyclin destmction box, N-degron, or a protein or 

25 a fragment thereof which is ubiquitinated in vivo, 

A mRNA destabiUzation sequence includes two or more nucleotides, 
which, when present in a mRNA, reduces or decreases, e.g., substantially 
reduces or decreases, for instance, having a reduction or decrease in the half-Ufe 
of the mRNA encoding a protein of interest of at least 20%, including 50%, 70% 

30 or greater, e.g., 90% or 99%, relative to a mRNA that lacks the mRNA 
destabilization sequence and encodes the corresponding protein. In one 
embodiment, a mRNA destabilization sequence has less than about 100 
nucleotides. A mRNA destabilization sequence includes, but is not Umited to, a 
sequence present in the 3' UTR of a mRNA which likely forms a stem-loop, one 
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or more AUUUA or UUAUUUAUU sequences, including the 3' UTR of the 
bradykinin Bl receptor gene. 

In one embodiment, the nucleic acid molecule is present in a vector, e.g., 
a plasmid. In one embodiment, the nucleic acid molecule encodes a destabilized 
5 fusion polypeptide comprising a reporter protein, which nucleic acid molecule 
comprises SEQ ID NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO:66, 
SEQ ID NO:68, SEQ ID NO:69, SEQ ID NO:70, SEQ ID N0:71, SEQ ID 
NO:72, SEQ ID NO:73, SEQ ID NO;74, SEQ ID NO:75, SEQ ID NO:76, SEQ 
ID NO:77, SEQ ID NO:78, SEQ ID NO:79; SEQ ID NO:80, or a fragment 

10 thereof that encodes a fusion polypeptide with substantially the same activity as 
the corresponding full-length fusion polypeptide. As used herein, "substantially 
the same activity" is at least about 70%, e.g., 80%, 90% or more, the activity of a 
corresponding full-length fusion polypeptide. 

As described herein, the combination of two protein degradation (CLl 

1 5 and mODC) sequences in the same luciferase fusion polypeptide resulted in a 
reduction of the half-Ufe of firefly luciferase to about 30 minutes and Renilla 
luciferase to about 20 minutes. Also, N-degron and mODC complemented each 
other in that the combination of these two degradation signals in the same 
protein resulted in a substantial increase in the rate of degradation of the 

20 corresponding protein. Moreover, introduction of a mRNA destabilization 3* to 
the open reading frame for the luciferase fusion polypeptide decreased the half- 
life of luciferase expression by destabiUzing sequence transcription. Further, the 
combination of a mRNA and a protein destabihzation sequence was shown to be 
effective in at least 3 different cells (HeLa, CHO and 293 cells) in shortening the 

25 expression of two different luciferase proteins. In addition, the presence of 

mammaUan cell-optimized sequences for a fusion polypeptide of the invention, 
in cells transfected with a plasmid comprising those sequences, enhanced the 
amount of light emitted by those cells as a result of the more efficient translation 
of RNA encoding the fusion polypeptide. Thus, the presence of optimized 

30 sequences including codon optimized sequences in a nucleic acid molecule 

encoding a fusion polypeptide of the invention, e.g., optimized sequences for the 
reporter protein, optimized sequences for the protein destabilization signal(s), or 
both, can yield an enhanced signal. 
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In one embodiment, the nucleic acid molecule comprises a nucleic acid 
sequence encoding a fusion polypeptide comprising at least one and preferably at 
least two heterologous protein destabilization signals, which fusion polypeptide 
has a half-life that is substantially reduced or decreased, e.g., having at least a 
5 80%, preferably at least a 90%, more preferably at least a 95% or more, e.g., 
99%, reduction or decrease in half-Hfe, relative to the half-life of a 
corresponding v^ld-type protein, and/or emits more light as a result of the 
optimization of the nucleic acid sequences for expression in a desired cell 
relative to a fusion polypeptide encoded by sequences which are not optimized 

1 0 for expression in that cell. In one embodiment, the reporter protein is a 

luciferase, for instance, a Coleopteran or anthozoan luciferase such as a firefly 
luciferase or a Renilla luciferase, and the luciferase fusion polypeptide includes 
at least one heterologous protein destabilization sequence and has a substantially 
reduced half-life relative to a corresponding wild-type (native or recombinant) 

15 luciferase. Preferably, optimized nucleic acid sequences encoding at least the 
reporter protein are employed, as those optimized sequences can increase the 
strength of the signal for destabihzed reporter proteins. 

In another embodiment, the nucleic acid molecule comprises at least one 
heterologous mRNA destabilization sequence and encodes a fiision polypeptide 

20 comprising at least one heterologous protein destabilization sequence. 

Preferably, the mRNA destabilization sequence is 3' to the nucleic acid sequence 
encoding the fusion polypeptide. In one embodiment, the expression of the 
fusion polypeptide is reduced relative to a polypeptide encoded by a nucleic acid 
molecule which lacks the heterologous destabilization sequences. 

25 In one embodiment, the nucleic acid molecule comprises a nucleic acid 

sequence comprising an open readmg frame for a reporter protein and at least 
one heterologous destabilization sequence, wherein a majority of codons in the 
open reading frame for the reporter protein are optimized for expression in a 
^ particular host cell, e.g., a mammaUan cell such as a human cell. The presence 

30 of codon optimized sequences in the nucleic acid molecule can compensate for 
reduced expression from a corresponding nucleic acid molecule which is not 
codon optimized. 

The invention further includes a vector and host cell comprising a nucleic 
acid molecule of the invention and kits comprising the nucleic acid molecule. 
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vector or host cell. In particular the invention proAades a stable cell line that 
expresses a rapid turnover reporter protein with an enhanced signal relative to a 
corresponding stable cell line that expresses a corresponding nondestabilized 
reporter protein. 

5 A rapid turnover (destabilized) reporter protein such as luciferase can be 

used in applications where currently available reporter proteins with half-Uves of 
expression at least several hours cannot, such as, as a genetic reporter for 
analyzing transcriptional regulation and/or cz5-acting regulatory elements, as a 
tool for identifying and analyzing degradation domains of short-lived proteins or 

10 to accelerate screening of efficacious compounds. Cells containing a regulatable 
vector of the invention respond more quickly to induction or repression and 
show enhanced activation relative to cells containing a vector expressing a 
corresponding unmodified, e.g., wild-type, reporter protein. Moreover, the 
presence of a vector of the invention in host cells used for screening is 

15 advantageous in that those cells are less sensitive to impaired cell growth or to 
modification or loss of the vector, and allows for more precise quantification of 
signal. 

Hence, the present invention also provides an expression cassette 
comprising the nucleic acid sequence of the invention and a vector capable of 

20 expressing the nucleic acid sequence in a host cell. Preferably, the expression 
cassette comprises a promoter, e.g., a constitutive or regulatable promoter, 
operably linked to the nucleic acid sequence. Li one embodiment of the vector, 
the expression cassette contains an inducible promoter. Also provided is a host 
cell, e.g., an eukaryotic cell such as a plant or vertebrate cell, e.g., a mammaUan 

25 cell, including but not limited to a human, non-human primate, canine, feline, 
bovine, equine, ovine or rodent (e.g., rabbit, rat, ferret or mouse) cell, and a kit 
which comprises the nucleic acid molecule, expression cassette or vector of the 
invention. 

hi another aspect of the mvention, there is provided a method of labeling 
30 cells with a fiision polypeptide of the invention, hi this method, a cell is 

contacted with a vector comprising a promoter, e.g., a regulatable promoter, and 
a nucleic acid sequence encoding a fusion polypeptide comprising a protein of 
interest such as a reporter protein with a substantially decreased half-life of 
expression relative to a corresponding wild-type protein, hi one embodiment, a 
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transfected cell is cultured under conditions in which the promoter induces 
transient expression of the fusion polypeptide, which provides a transient 
reporter label. 

5 Brief Description of the Figures 

Figure lA. Lysates of CHO cells containing plasmid pwtLucl (lane 2), 
pUbiq(Y)Lucl9 (lane 3) or pLuc-PESTlO (lane 4), or a CHO lysate without 
plasmid (lane 5), were separated on 4-20% SDS-PAGE, transferred on to an 
ImmobilonP membrane and luciferase species were detected with rabbit anti- 
1 0 firefly luciferase and anti-rabbit antibodies conjugated with alkalme 

phosphatase. Lane 1 corresponds to See Blue Pre-Stained Standard firom 
Invitrogen. 

Figure IB. Proteins translated with wheat germ extracts containing 
mRNA obtained using plasmid pwtLucl (lane 1) or pETUbiqLuc (lane 2), or 
1 5 without external mRNA (lane 3), were separated on 4-20% SDS-PAGE and the 
proteins visuahzed by autoradiography. 

Figure IC. TNT® T7 Coupled Reticulocyte Lysates containing plasmid 
pETwtLucl (lane 1), pT7Ubiq(Y)Lucl9.2 (lane 2), pT7 Ubiq(E)Lucl9.1 (lane 3) 
or pT7Luc-PEST10 (lane 4), were separated on 4-20% SDS-PAGE and the 
20 proteins visuahzed by autoradiography. 

Figure 2. Plasmids encoding wild-type firefly luciferase and fiision 
proteins comprising firefly luciferase were expressed in TNT® T7 Coupled 
Reticulocyte Lysate System. Specific activity was determined as the ratio 
between total luciferase activity accumulated in each mixture and the amount of 
25 [^H]-Leucine incorporated in each protein. 

Figure 3. Cells transiently transfected with plasmids encoding wild-type 
firefly luciferase (pwtLucl), a ubiquitin-luciferase fiision protein 
(pUbiq(Y)Lucl9 and pT7Ubiq(Y)Lucl9.2), or a fiision protein comprismg 
firefly luciferase and a mutant form of C-ODC (mODC) (pLuc-PESTl 0) were 
30 treated with cycloheximide (100 jxg/ml) for different periods of time. Upon 
completion of incubation, and to define stability, cells were lysed, and 
accumulated luciferase activity was determined using a MLX Microtiter Plate 
Luminometer. 
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Figure 4. CHO (A), COS-7 (B), and HeLa (C) ceUs, transfected with 
ubiquitin-luciferase fusion protein encoding plasmids, were treated with 
cyclohexinaide for different periods of time. Cellular luminescence was 
measured to determine the stability of the corresponding proteins. Control cells 
5 that had not been treated with cycloheximide were used to determine background 
luciferase activity. 

Figure 5. The partial amino acid sequence of ubiquitin-luciferase fusion 
proteins was evaluated m establishing the relative importance of the N-terminal 
residue in detemiining protein half-Ufe. Shadowed/boxed areas mark ubiquitin 
1 0 and luciferase sequences. Thick lines mark the position of deletions. 

Figure 6. CHO (A) and COS-7 (B) cells were transiently transfected 
with plasmids encoding either wild-type firefly luciferase (pwtLucl) or 
ubiquitin-luciferase fusion proteins with different N-terminal luciferase amino 
acid residues. Twenty-four hours after transfection, the cells were treated with 
15 cycloheximide (100 jig/ml) for different periods of time and, upon completion of 
incubation, luminescence of accumulated luciferase was measured. 

Figure 7. HeLa cells were transfected with plasmids encoding wild-type 
luciferase (pwtLucl), a fusion protein comprising luciferase and mODC (pLuc- 
PESTIO), or a fusion protein comprising ubiquitin, firefly luciferase, and mODC 
20 (pUbiq(Y)Luc-PEST5, pUbiq(R)Luc-PEST12, pT7Ubiq(E)Luc-PEST23 and 

pT7lJbiq(E)hLuc+PEST80). Twenty-four hours after transfection, the cells were 
treated with cycloheximide (100 fig/ml) for different periods of time. Cellular 
luminescence was measured to determine the stabiUty of the corresponding 
luciferase (A). Control cells that had not been treated with cycloheximide were 
25 used to compare the luciferase activity of different constructs (B). 

Figure 8. CHO cells were transiently transfected with various plasmids. 
Twenty-four hours post-transfection, the cells were treated with cycloheximide 
(100 |ig/ml) for different periods of time. After incubation, luminescence due to 
accumulated luciferase was measured. Control cells that had not been treated 
30 with cycloheximide were used to determine background luciferase activity. 

Figure 9. Comparison of luciferase fusion protein properties in a tet 
inducible system after doxycycline (2 /^g/ml) (A) or cycloheximide (100 /ig/ml) 
(B) treatment. Luminescence data from control cells that had not been treated 
with either doxycycline or cycloheximide are depicted in panel C. 
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Figures 10 A-B. Comparison of luciferase fusion protein properties 
Renilla luciferase (A) and jBrefly luciferase (B) in a heat shock inducible system. 
Figure 1 1 . Schematic of selected vectors. 

Figures 12 A-B. Induction of luminescence in D293 cells transiently 
5 transfected with Renilla luciferase vectors with multiple CREs, forskolin (1 0 
liM) and isoproterenol (0.25 /iM). 

Figure 13 A-B. Luminescence profiles of hCG-D293 cells transiently 
transfected with vectors encoding stable and destabilized versions of firefly 
luciferase. Cells were treated with isoproterenol (1 jiM) and Ro-20-1724 (100 
10 fxM) or isoproterenol (1 iM) and Ro-20-1724 (100 /iM) followed by treatment 
with human chorionic gonadotropin (hCG) (10 ng/ml) and Ro-20-1724 (100 
liM). Arrows indicate time points when chemicals were added to the cell 
cultures. 

Figure 14. Luminescence versus fold induction in D293 cells stably 
15 transfected with destabihzed vectors. Cells were treated with forskoUn (10 /xM) 
for 7 hours or incubated in forskolin-free media. All vectors were under the 
control of a cAMP regulated promoter. 

Figure 15. Fold induction by isoproterenol and prostaglandin El (PGEl) 
in 293 cells transfected with codon optimized firefly or Renilla luciferase in 
20 conjunction with destabilization sequences in a CRE system. (A)-(B): PGEl 
added 24 hours after Iso/Ro; (C)-(D): PGEl added 6 hours after Iso/Ro. 

Figure 16. Fold induction by isoproterenol in 293 cells transfected with 
either red (CBR) (B) or green (CBG) (A) click beetle sequences in conjunction 
with destabilization sequences in a CRE system 

25 

Detailed Description of the Invention 

Definitions 

The term "nucleic acid molecule", "gene" or "nucleic acid sequence" as 
used herein, refers to nucleic acid, DNA or RNA, that comprises coding 
3 0 sequences necessary for the production of a polypeptide or protein precursor. 
The polypeptide can be encoded by a full-length coding sequence or by any 
portion of the coding sequence, as long as the desired protein activity is retained. 

A "nucleic acid", as used herein, is a covalently linked sequence of 
nucleotides in which the 3' position of the pentose of one nucleotide is joined by 
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a phosphodiester group to the 5' position of the pentose of the next, and in which 
the nucleotide residues (bases) are linked in specific sequence, i.e., a linear order 
of nucleotides. A "polynucleotide", as used herein, is a nucleic acid containing a 
sequence that is greater than about 100 nucleotides in length. An 
5 "oUgonucleotide" or "primer", as used herein, is a short polynucleotide or a 

portion of a polynucleotide. An oligonucleotide typically contains a sequence of 
about two to about one hundred bases. The word "oUgo" is sometimes used in 
place of the word "oligonucleotide". 

Nucleic acid molecules are said to have a "5'-terminus" (5' end) and a 

10 "3'-terminus" (3' end) because nucleic acid phosphodiester linkages occur to the 
5' carbon and 3' carbon of the pentose ring of the substituent mononucleotides. 
The end of a polynucleotide at which a new Unkage would be to a 5' carbon is its 
5' terminal nucleotide. The end of a polynucleotide at which a new linkage 
would be to a 3' carbon is its 3' terminal nucleotide. A terminal nucleotide, as 

15 used herein, is the nucleotide at the end position of the 3'- or 5'-terminus. 

DNA molecules are said to have "5' ends" and "3' ends" because 
mononucleotides are reacted to make oligonucleotides in a manner such that the 
5' phosphate of one mononucleotide pentose ring is attached to the 3' oxygen of 
its neighbor in one direction via a phosphodiester linkage. Therefore, an end of 

20 an oUgonucleotides referred to as the "5' end" if its 5' phosphate is not linked to 
the 3' oxygen of a mononucleotide pentose ring and as the "3' end" if its 3' 
oxygen is not linked to a 5' phosphate of a subsequent mononucleotide pentose 
ring. 

As used herein, a nucleic acid sequence, even if internal to a larger 
25 oUgonucleotide or polynucleotide, also may be said to have 5' and 3' ends. In 
either a linear or circular DNA molecule, discrete elements are referred to as 
being "upstream" or 5' of the "downstream" or 3' elements. This terminology 
reflects the fact that transcription proceeds in a 5' to 3' fashion along the DNA 
strand. Typically, promoter and enhancer elements that direct transcription of a 
30 linked gene (e.g., open reading iframe or coding region) are generally located 5' 
or upstream of the coding region. However, enhancer elements can exert their 
effect even when located 3' of the promoter element and the coding region. 

12 
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Transcription termination and polyadenylation signals are located 3' or 
downstream of the coding region. 

The term "codon" as used herein, is a basic genetic coding unit, 
consisting of a sequence of three nucleotides that specify a particular amino acid 
5 to be incorporation into a polypeptide chain, or a start or stop signal. The term 
"coding region" when used in reference to structural genes refers to the 
nucleotide sequences that encode the amino acids found in the nascent 
polypeptide as a result of translation of a mRNA molecule. Typically, the 
' coding region is bounded on tiie 5' side by the nucleotide triplet "ATG" which 
10 encodes the initiator methionine and on the 3' side by a stop codon (e.g., TAA, 
TAG, TGA). In some cases the coding region is also known to initiate by a 
nucleotide triplet "TTG". 

By "protein" and "polypeptide" is meant any chain of amino acids, 
regardless of length or post-translational modification (e.g., glycosylation or 
1 5 phosphorylation). The nucleic acid molecules of the invention may also encode 
a variant of a naturally-occurring protein or polypeptide fragment thereof. 
Preferably, such a protein polypeptide has an amino acid sequence that is at least 
85%, preferably 90%, and most preferably 95% or 99% identical to the amino 
acid sequence of the naturally-occurring (native or wild-type) protein from 
20 which it is derived. 

Polypeptide molecules are said to have an "amino terminus" 
(N-terminus) and a "carboxy terminus" (C-terrainus) because peptide linkages 
occur between the backbone amino group of a first amino acid residue and the 
backbone carboxyl group of a second amino acid residue. The terms 
25 "N-terminal" and "C-terminal" in reference to polypeptide sequences refer to 
regions of polypeptides including portions of the N-terminal and C-terminal 
regions of the polypeptide, respectively. A sequence that includes a portion of 
the N-terminal region of a polypeptide includes amino acids predominantly from 
the N-terminal half of the polypeptide chain, but is not limited to such 
30 sequences. For example, an N-terminal sequence may include an interior portion 
of the polypeptide sequence including bases from both the N-terminal and 
C-terminal halves of the polypeptide. The same appUes to C-terminal regions. 
N-terminal and C-terminal regions may, but need not, include the amino acid 
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defining the ultimate N-terminus and C-terminus of the polypeptide, 
respectively. 

The term "wild-type" as used herein, refers to a gene or gene product that 
has the characteristics of that gene or gene product isolated from a naturally 
5 occurring source. A wild-type gene is that which is most frequently observed in 
a population and is thus arbitrarily designated the "wild-type" form of the gene. 
In contrast, the term "mutant" refers to a gene or gene product that displays 
modifications in sequence and/or functional properties (i.e., altered 
characteristics) when compared to the wild-type gene or gene product. It is 
1 0 noted that naturally-occurring mutants can be isolated; these are identified by the 
fact that they have altered characteristics when compared to the wild-type gene 
or gene product. 

The term "recombinant protein" or "recombinant polypeptide" as used 
herein refers to a protein molecule expressed from a recombinant DNA 

15 molecule. In contrast, the term "native protein" is used herein to indicate a 
protein isolated from a naturally occurring (i.e., a nonrecombinant) source. 
Molecular biological techniques may be used to produce a recombinant form of 
a protein with identical properties as compared to the native form of the protein. 
The term "fiision polypeptide" refers to a chimeric protein containing a 

20 protein of laterest (e.g., luciferase) joined to a heterologous sequence (e.g., a 
non-luciferase amino acid or protein). 

The teims "cell," "cell line," "host cell," as used herein, are used 
interchangeably, and all such designations include progeny or potential progeny 
of these designations. By "transformed cell" is meant a cell into which (or into 

25 an ancestor of which) has been introduced a nucleic acid molecule of the 

invention, e.g., via transient transfection. Optionally, a nucleic acid molecule 
synthetic gene of the invention may be introduced into a suitable cell line so as 
to create a stably-transfected cell fine capable of producing the protein or 
polypeptide encoded by the synthetic gene. Vectors, cells, and methods for 

30 constructing such cell lines are well known in the art. The words 

"transfomiants" or "transformed cells" include the primary transformed cells 
derived from the originally transformed cell without regard to the number of 
transfers. All progeny may not be precisely identical in DNA content, due to 
deliberate or inadvertent mutations. Nonetheless, mutant progeny that have the 
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same functionality as screened for in the originally transformed cell are included 

in the definition of transfoimants. 

Nucleic acids are known to contain different types of mutations. A 

"point" mutation refers to an alteration in the sequence of a nucleotide at a single 
5 base position fi-om the wild type sequence. Mutations may also refer to insertion 

or deletion of one or more bases, so that the nucleic acid sequence differs firom 

the wild-type sequence. 

The term "homology" refers to a degree of complementarity. There may 

be partial homology or complete homology (i.e., identity). Homology is often 
10 measured using sequence analysis software (e.g.. Sequence Analysis Software 

Package of the Genetics Computer Group, University of Wisconsin 

Bioteclmology Center. 1710 University Avenue. Madison, WI 53705). Such 

software matches similar sequences by assigning degrees of homology to various 

substitutions, deletions, insertions, and other modifications. Conservative 
15 substitutions typically include substitutions within the following groups: 

glycine, alanine; valine, isoleucine, leucine; aspartic acid, glutamic acid, 

asparagine, glutamine; serine, threonine; lysine, arginine; and phenylalanine, 

tyrosine. 

The term "isolated" when used in relation to a nucleic acid, as in 
20 "isolated oUgonucleotide" or "isolated polynucleotide" refers to a nucleic acid 
sequence that is identified and separated from at least one contaminant with 
which it is ordinarily associated in its soiirce. Thus, an isolated nucleic acid is 
present in a form or setting that is different firom that in which it is found in 
nature. In contrast, non-isolated nucleic acids (e.g., DNA and RNA) are found 
25 in the state they exist in nature. For example, a given DNA sequence (e.g., a 
gene) is found on the host cell chromosome in proximity to neighboring genes; 
RNA sequences (e.g., a specific mRNA sequence encoding a specific protein), 
are found in the cell as a mixture with numerous other mRNAs that encode a 
multitude of proteins. However, isolated nucleic acid includes, by way of 
30 example, such nucleic acid in cells ordinarily expressing that nucleic acid where 
the nucleic acid is in a chromosomal location different from that of natural cells, 
or is otherwise flanked by a different nucleic acid sequence than that found in 
nature. The isolated nucleic acid or oligonucleotide may be present in 
single-stranded or double-stranded form. When an isolated nucleic acid or 
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oligonucleotide is to be utilized to express a protein, the oligonucleotide contains 
at a minimum, the sense or coding strand (i.e., the ohgonucleotide may be 
single-stranded), but may contain both the sense and anti-sense strands (i.e., the 
oligonucleotide may be double-stranded). 
5 The term "isolated" when used in relation to a polypeptide, as in "isolated 

protein" or "isolated polypeptide" refers to a polypeptide that is identified and 
separated from at least one contaminant with which it is ordinarily associated in 
its source. Thus, an isolated polypeptide is present in a form or setting that is 
different from that in which it is found in nature. In contrast, non-isolated 
1 0 polypeptides (e.g., proteins and enzymes) are found in the state they exist in 
nature. 

The term "purified" or "to purify" means the result of any process that 
removes some of a contaminant from the component of interest, such as a 
protein or nucleic acid. The percent of a purified component is thereby 

1 5 increased in the sample. 

The term "operably linked" as used herein refer to the linkage of nucleic 
acid sequences in such a manner that a nucleic acid molecule capable of 
directing the transcription of a given gene and/or the synthesis of a desired 
protein molecule is produced. The term also refers to the linkage of sequences 

20 encoding amino acids in such a manner that a fiinctional (e.g., enzymatically 
active, capable of binding to a binding partner, capable of inhibiting, etc.) 
protein or polypeptide is produced. 

The term "recombinant DNA molecule" means a hybrid DNA sequence 
comprising at least two nucleotide sequences not normally found together in 

25 nature. The term "vector" is used in reference to nucleic acid molecules 

into which fragments of DNA may be inserted or cloned and can be used to 
transfer DNA segment(s) into a cell and capable of replication in a cell. Vectors 
may be derived from plasmids, bacteriophages, viruses, cosmids, and the like. 
The terms "recombinant vector" and "expression vector" as used herein 

30 refer to DNA or RNA sequences containing a desired coding sequence and 
appropriate DNA or RNA sequences necessary for the expression of the 
operably linked coding sequence in a particular host organism. Prokaryotic 
expression vectors include a promoter, a ribosome binding site, an origin of 
replication for autonomous replication in a host cell and possibly other 
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sequences, e.g. an optional operator sequence, optional restriction enzyme sites. 
A promoter is defined as a DNA sequence that directs RNA polymerase to bind 
to DNA and to initiate RNA synthesis. Eukaryotic expression vectors include a 
promoter, optionally a polyadenlyation signal and optionally an enhancer 
5 sequence. 

A polynucleotide having a nucleotide sequence encoding a protein or 
polypq)tide means a nucleic acid sequence comprising the coding region of a 
gene, or in other words the nucleic acid sequence encodes a gene product. The 
coding region may be present in either a cDNA, genomic DNA or RNA form. 
10 When present in a DNA form, the ohgonucleotide may be single-stranded (i.e., 
the sense strand) or double-stranded. Suitable control elements such as 
enhancers/promoters, splice junctions, polyadenylation signals, etc. maybe 
placed in close proximity to the coding region of the gene if needed to permit 
proper initiation of transcription and/or correct processing of the primary RNA 
15 transcript. Alternatively, the coding region utiUzed in the expression vectors of 
the present invention may contain endogenous enhancers/promoters, splice 
junctions, intervening sequences, polyadenylation signals, etc. In fiirther 
embodiments, the coding region may contain a combination of both endogenous 
and exogenous control elements. 
20 The term "transcription regulatory element" or "transcription regulatory 

sequence" refers to a genetic element or sequence that controls some aspect of 
the expression of nucleic acid sequence(s). For example, a promoter is a 
regulatory element that facilitates the initiation of transcription of an operably 
linked coding region. Other regulatory elements include, but are not Umited to, 
25 transcription factor buiding sites, sphcing signals, polyadenylation signals, 
termination signals and enhancer elements. 

Transcriptional control signals in eukaryotes comprise "promoter" and 
"enhancer" elements. Promoters and enhancers consist of short arrays of DNA 
sequences that interact specifically with cellular proteins involved in 
30 transcription. Promoter and enhancer elements have been isolated from a variety 
of eukaryotic sources including genes in yeast, insect and mammaUan cells. 
Promoter and enhancer elements have also been isolated from viruses and 
analogous control elements, such as promoters, are also found in prokaryotes. 
The selection of a particular promoter and enhancer depends on the cell type 
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used to express the protein of interest. Some eukaryotic promoters and 
enhancers have a broad host range while others are functional in a Umited subset 
of cell types. For example, the SV40 early gene enhancer is very active in a 
wide variety of cell types jfrom many mammalian species and has been widely 

5 used for the expression of proteins in mammaUan cells, Two other examples of 
promoter/enhancer elements active in a broad range of mammaUan cell types are 
those from the human elongation factor 1 gene (Uetsuki et al., 1989; Kim et al., 
1990; and Mizushima and Nagata, 1990) and the long terminal repeats of the 
Rous sarcoma virus (Gorman et al, 1982); and the human cytomegalovirus 

10 (Boshart et al., 1985). 

The term "promoter/enhancer" denotes a segment of DNA containing 
sequences capable of providing both promoter and enhancer functions (i.e., the 
functions provided by a promoter element and an enhancer element as described 
above). For example, the long terminal repeats of retroviruses contain both 

1 5 promoter and enhancer functions. The enhancer/promoter may be "endogenous" 
or "exogenous" or "heterologous." An "endogenous" enhancer/promoter is one 
that is naturally linked with a given gene in the genome. An "exogenous" or 
"heterologous" enhancer/promoter is one that is placed in juxtaposition to a gene 
by means of genetic manipulation (i.e., molecular biological techniques) such 

20 that transcription of the gene is directed by the linked enhancer/promoter. 

The presence of "splicing signals" on an expression vector often results 
in higher levels of expression of the recombinant transcript in eukaryotic host 
cells. Splicing signals mediate the removal of introns from the primary RNA 
transcript and consist of a splice donor and acceptor site (Sambrook et al., 1989). 

25 A commonly used splice donor and acceptor site is the splice junction from the 
16SRNAofSV40. 

Efficient expression of recombmant DNA sequences in eukaryotic cells 
requires expression of signals directing the efficient termination and 
polyadenylation of the resulting transcript. Transcription termination signals are 

30 generally foimd downstream of the polyadenylation signal and are a few hundred 
nucleotides in length. The term "poly(A) site" or "poly(A) sequence" as used 
herein denotes a DNA sequence which directs both the termination and 
polyadenylation of the nascent RNA transcript. Efficient polyadenylation of the 
recombinant transcript is desirable, as transcripts lacking a poly(A) tail are 
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unstable and are rapidly degraded. The poly(A) signal utilized in an expression 
vector may be "heterologous" or "endogenous." An endogenous poly(A) signal 
is one that is found naturally at the 3' end of the coding region of a given gene in 
the genome. A heterologous poly(A) signal is one which has been isolated from 

5 one gene and positioned 3' to another gene. A commonly used heterologous 
poly(A) signal is the SV40 poly(A) signal. The SV40 poly(A) signal is 
contained on a 237 bp 5amH VBcl I restriction fragment and directs both 
termination and polyadenylation (Sambrook et al., 1989). 

Eukaryotic expression vectors may also contain "viral repUcons "or "viral 

1 0 origins of rephcation." Viral replicons are viral DNA sequences which allow for 
the extrachromosomal rephcation of a vector in a host cell expressing the 
appropriate rephcation factors. Vectors containing either the SV40 or polyoma 
virus origin of rephcation rephcate to high copy number (up to 10"* copies/cell) 
in cells that express the appropriate viral T antigen. In contrast, vectors 

1 5 containing the repHcons from bovine papillomavirus or Epstein-Barr virus 
replicate extrachromosomally at low copy number (about 100 copies/cell). 

The term "m vitro'' refers to an artificial enviroimient and to processes or 
reactions that occur within an artificial environment. In vitro environments 
include, but are not limited to, test tubes and cell lysates. The term "z« situ" 

20 refers to cell culture. The term "m vivo'' refers to the natural environment (e.g., 
an animal or a cell) and to processes or reactions that occur within a natural 
environment. 

The term "expression system" refers to any assay or system for 
determining (e.g., detecting) the expression of a gene of interest. Those skilled 

25 in the field of molecular biology will understand that any of a wide variety of 
expression systems may be used. A wide range of suitable mammahan cells are 
available from a wide range of sources (e.g., the American Type Culture 
Collection, Rockland, MD). The method of transfomiation or transfection and 
the choice of expression vehicle will depend on the host system selected. 

30 Transformation and transfection methods are described, e.g., in Ausubel et al., 
1992. Expression systems include in vitro gene expression assays where a gene 
of interest (e.g., a reporter gene) is linked to a regulatory sequence and the 
expression of the gene is monitored following treatment with an agent that 
inhibits or induces expression of the gene. Detection of gene expression can be 
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through any suitable means including, but not limited to, detection of expressed 
mRNA or protein (e.g., a detectable product of a reporter gene) or through a 
detectable change in the phenotype of a cell expressing the gene of interest. 
Expression systems may also comprise assays where a cleavage event or other 
5 nucleic acid or cellular change is detected. 

All amino acid residues identified herein are in the natural 
L-configuration. hi keeping with standard polypeptide nomenclature, 
abbreviations for amino acid residues are as shown in the following Table of 
Correspondence. 

10 

TABLE OF CORRESPONDENCE 



1 -Letter 


3-Letter 


AMINO ACID 


Y 


Tyr 


L-tyrosine 


G 


Gly 


L-glycine 


F 


Phe 


L-phenylalanine 


M 


Met 


L-methionine 


A 


Ala 


L-alanine 


S 


Ser 


L-serine 


I 


De 


L-isoleucine 


L 


Leu 


L-leucine 


T 


Thr 


L-threonine 


V 


Val 


L-valine 


P 


Pro 


L-proline 


K 


Lys 


L-lysine 


H 


ffis 


L-histidine 


Q 


Gin 


L-glutamine 


E 


Glu 


L-glutamic acid 


W 


Tip 


L-tryptophan 


R 


Arg 


L-argmine 


D 


Asp 


L-aspartic acid 


N 


Asn 


L-asparagine 


C 


Cys 


L-cysteine 
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The invention provides compositions comprising nucleic acid molecules 
comprising nucleic acid sequences encoding fusion polypeptides, as well as 
methods for using those molecules to yield fusion polypeptides, comprising a 
protein of interest with a reduced, e.g., a substantially reduced, half-life of 
5 expression relative to a conresponding parental (e.g., wild-type) polypeptide. 
The invention also provides a fusion polypeptide encoded by such a nucleic acid 
molecule. The invention may be employed to reduce the half-hfe of expression 
of any protein of interest, e.g., the half-life of a reporter protein, hi particular, the 
invention provides an isolated nucleic acid molecule comprising a nucleic acid 

1 0 sequence encodmg a fusion polypeptide comprising a protein of interest and a 
combination of heterologous destabilization sequences, e.g., one or more 
heterologous protem destabilization sequences and/or one or more heterologous 
mRNA destabihzation sequences, which results in a substantial reduction in the 
half-hfe of expression of the encoded fusion polypeptide. Heterologous protem 

15 destabihzation sequences may be at the N-terminus or the C-terminus, or at the 
N-terminus and the C-terminus of the protein of interest. A heterologous protein 
destabilization sequence may include 2 or more, e.g., 3 to 200, or any integer m 
between 3 and 200, amino acid residues, although not all of the residues in 
longer sequences, e.g., those greater than 5 residues in length, may be capable of 

20 destabilizing a Imked amino acid sequence. Multiple copies of any one protein 
destabihzation sequence may also be employed with a protein of interest. In one 
embodiment, different protem destabihzation sequences are employed, e.g., a 
combination of a CL sequence and a PEST sequence. Heterologous mRNA 
destabilization sequences are preferably 3' to the coding region for a fusion 

25 polypeptide of the invention. A heterologous mRNA destabihzation sequence 
may mclude 5 or more, e.g., 6 to 100, or any mteger m between 6 and 100, 
nucleotides, although not all of the residues in longer sequences, e.g., those 
greater than 10 nucleotides, may be capable of destabihzing a linked nucleotide 
sequence. Multiple copies of any one mRNA destabilization sequence may be 

30 employed. In one embodiment, different mRNA destabihzation sequences are 
employed. 

Optionally, a second polypeptide may be fused to the N-terminus of a 
fusion polypeptide comprising a protein of interest and a heterologous protein 
destabilization sequence, e.g., a destabihzation sequence which is present at the 
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N-terminus of the protein of interest. In one embodiment, the second 
polypeptide is a polypeptide which is cleaved after the C-tenninal residue by an 
enzyme present in a cell or cell extract, yielding a fusion polypeptide comprising 
a protein of interest with a heterologous protein destabilization sequence, e.g., at 
5 its N-terminus. In one embodiment, the second polypeptide is ubiquitin. 

In one embodiment, the N-terminal heterologous protein destabiUzation 
sequence is a cyclin destruction box or N-degron. In one embodiment, the C- 
terminal heterologous protein destabilization sequence is a CL peptide, CLl, 
CL2, CL6, CL9, CLIO, CLl 1, CL12, CL15, CL216, or CL17, SL17 (see Table 1 

1 0 of Gilon et al., 1 998, which is specifically incorporated by reference herein), a 
C-ODC or a mutant C-ODC, e.g., a sequence such as 
HGFXXXMXXQXXGTLPMSCAQESGXXRHPAACASARINV 
(corresponding to residues 423-461 of mODC), wherein one or more of the 
residues at positions marked with **X" are not the naturally occurring residue and 

1 5 wherein the substitution results in a decrease in the stability of a protein having 
that substituted sequence relative to a protein having the nonsubstituted 
sequence. For instance, a jfiision polypeptide comprising a mutant C-ODC 
which has a non-conservative substitution at residues corresponding to residues 
426, 427, 428, 430, 431, 433, 434, or 448 of ODC, e.g., from proline, aspartic 

20 acid or glutamic acid to alanine, can resuU in a fusion polypeptide with 

decreased stability, e.g., relative to a fusion polypeptide with a non-substituted 
C-ODC. 

The invention may be employed with any nucleic acid sequence, e.g., a 
native sequence such as a cDNA or one which has been manipulated in vitro, 

25 e.g., but is particularly useful for reporter genes as well as other genes associated 
with the expression of reporter genes, such as selectable markers. Preferred 
genes include, but are not limited to, those encoding lactamase (P-gal), 
neomycin resistance (Neo), CAT, GUS, galactopyranoside, GFP, xylosidase, 
thymidine kinase, arabinosidase and the like. As used herein, a "marker gene" 

30 or "reporter gene'* is a gene that imparts a distinct phenotype to cells expressing 
the gene and thus permits cells having the gene to be distinguished from cells 
that do not have the gene. Such genes may encode either a selectable or 
screenable marker, depending on whether the marker confers a trait which one 
can 'select' for by chemical means, i.e., through the use of a selective agent (e.g.. 
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a herbicide, antibiotic, or the like), or whether it is simply a "reporter" trait that 
one can identify through observation or testing, i.e., by 'screening' . Elements of 
the present disclosure are exemplified in detail through the \ise of particular 
marker genes. Of coxurse, many examples of suitable marker genes or reporter 
5 genes are known to the art and can be employed in the practice of the invention. 
Therefore, it will be understood that the following discussion is exemplary rather 
than exhaustive. In hght of the techniques disclosed herein and the general 
recombinant techniques which are known in the art, the present invention 
renders possible the alteration of any gene. 

10 Exemplary genes include, but are not limited to, a neo gene, a P-gal gene, 

a gus gene, a cat gene, a gpt gene, a hyg gene, a hisD gene, a ble gene, a mprt 
gene, a bar gene, a nitrilase gene, a mutant acetolactate synthase gene (ALS) or 
acetoacid synthase gene (AAS), a methotrexate-resistant dhfr gene, a dalapon 
dehalogenase gene, a mutated anthranilate synthase gene that confers resistance 

15 to 5-methyl tryptophan (WO 97/26366), an R-locus gene, a p-lactamase gene, a 
xylE gene, an a-amylase gene, a tyrosinase gene, a luciferase Que) gene, (e.g., a 
Renilla reniformis luciferase gene, a firefly luciferase gene, or a click beetle 
luciferase {Pyrophorus plagiophthalamus) gene, an aequorin gene, or a green 
fluorescent protein gene. Included within the terms selectable or screenable 

20 marker genes are also genes which encode a "secretable marker" whose 

secretion can be detected as a means of identifying or selecting for transformed 
cells. Examples include markers which encode a secretable antigen that can be 
identified by antibody interaction, or even secretable enzymes which can be 
detected by their catalytic activity. Secretable proteins fall into a number of 

25 classes, including small, diffusible proteins detectable, e.g., by ELISA, and 
proteins that are inserted or trapped in the cell membrane. 

In one embodiment, the nucleic acid sequence encoding the fusion 
polypeptide is optimized for expression in a particular cell. For example, the 
nucleic acid sequence is optimized by replacing codons in the wild-type 

30 sequence with codons which are preferentially employed in a particular 

(selected) cell. Preferred codons have a relatively high codon usage frequency in 
a selected cell, and preferably their introduction results in the introduction of 
relatively few transcription factor bmding sites, and relatively few other 
undesirable structural attributes. Thus, the optimized nucleic acid product has an 
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improved level of expression due to improved codon usage frequency, and a 
reduced risk of inappropriate transcriptional behavior due to a reduced number 
of undesirable transcription regulatory sequences. 

An isolated nucleic acid molecule of the invention which is optimized 
5 may have a codon composition that differs from that of the corresponding wild- 
type nucleic acid sequence at more than 30%, 35%, 40% or more than 45%, e.g., 
50%, 55%, 60% or more of the codons. Preferred codons for use in the 
invention are those which are employed more frequently than at least one other 
codon for the same amino acid in a particular organism and, more preferably, are 
1 0 also not low-usage codons in that organism and are not low-usage codons in the 
organism used to clone or screen for the expression of the nucleic acid molecule. 
Moreover, preferred codons for certain amino acids (i.e., those amino acids that 
have three or more codons,), may include two or more codons that are employed 
more frequently than the other (non-preferred) codon(s). The presence of 

15 codons in the nucleic acid molecule that are employed more frequently in one 
organism than in another organism results in a nucleic acid molecule which, 
when introduced into the cells of the organism that employs those codons more 
frequently, is expressed in those cells at a level that is greater than the expression 
of the wild-type or parent nucleic acid sequence in those cells. 

20 In one embodiment of the invention, the codons that are different are 

those employed more frequently in a mammal, while in another embodiment the 
codons that are different are those employed more frequently in a plant. A 
particular type of mammal, e.g., human, may have a different set of preferred 
codons than another type of mammal. Likewise, a particular type of plant may 

25 have a different set of preferred codons than another type of plant. In one 

embodiment of the invention, the majority of the codons which differ are ones 
that are preferred codons in a desired host cell. Preferred codons for mammals 
(e.g., humans) and plants are known to the art (e.g., Wada et al., 1990). For 
example, preferred human codons include, but are not limited to, CGC (Arg), 

30 CTG (Leu), TCT (Ser), AGC (Ser), ACC (Thr), CCA (Pro), CCT (Pro), GCC 
(Ala), GGC (Gly), GTG (Val), ATC (He), ATT (He), AAG (Lys), AAC (Asn), 
CAG (Ghi), CAC (His), GAG (Glu), GAC (Asp), TAG (Tyr), TGC (Cys) and 
TTC (Phe) (Wada et al, 1990). Thus, in one embodiment, synthetic nucleic acid 
molecules of the invention have a codon composition which differs from a wild 



24 



wo 2004/025264 



PCTAJS20D3/028939 



type nucleic acid sequence by having an increased number of the preferred 
human codons, e.g. CGC, CTG, TCT, AGC, ACC, CCA, CCT, GCC, GGC, 
GTG, ATC, ATT, AAG, AAC, CAG, CAC, GAG, GAC, TAG, TGC, TTC, or 
any combination thereof For example, the nucleic acid molecule of the 

5 invention may have an increased number of CTG or TTG leucine-encoding 
codons, GTG or GTC vahne-encoding codons, GGC or GGT glycine-encoding 
codons, ATC or ATT isoleucine-encoding codons, CCA or CCT proline- 
encoding codons, CGC or CGT arginine-encoding codons, AGC or TCT serine- 
encoding codons, ACC or ACT threonine-encoding codon, GCC or GCT 

10 alanine-encoding codons, or any combination thereof^ relative to the wild-type 
nucleic acid sequence. Similarly, nucleic acid molecules having an increased 
number of codons that are employed more frequently in plants, have a codon 
composition which differs from a wild-type or parent nucleic acid sequence by 
having an increased number of the plant codons including, but not limited to, 

1 5 CGC (Arg), CTT (Leu), TCT (Ser), TCC (Ser), ACC (Thr), CCA (Pro), CCT 
(Pro), GCT (Ser), GGA (Gly), GTG (Val), ATC (He), ATT Qlo), AAG (Lys), 
AAC (Asn), CAA (Gin), CAC (His), GAG (Glu), GAC (Asp), TAC (Tyr), TGC 
(Cys), TTC (Phe), or any combmation thereof (Munay et al, 1989). Preferred 
codons may differ for different types of plants (Wada et al, 1990). 

20 A nucleic acid molecule comprising a nucleic acid sequence encoding a 

frision polypeptide of the invention is optionally operably linked to transcription 
regulatory sequences, e.g., enhancers, promoters and transcription termination 
sequences to form an expression cassette. The nucleic acid molecule is 
introduced to a vector, e.g., a plasmid or viral vector, which optionally a , 

25 selectable marker gene, and the vector introduced to a cell of interest, for 

example, a plant (dicot or monocot), fimgus, yeast or mammaUan cell. Preferred 
host cells are mammahan cells such as CHO, COS, 293, Hela, CV-1, and 
NIH3T3 cells. 

The expression of the encoded ftision polypeptide may be controlled by 
30 any promoter, includiag but not limited to regulatable promoters, e.g., an 

inducible or repressible promoter such as the tet promoter, the hsp70 promoter 
and a synthetic promoter regulated by CRE. For example, in the tet-regulated 
system, the luminescent signal for a wild-type luciferase dissipated by 16-17 
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hours while the signal for a fusion polypeptide comprising a heterologous 
destabilization sequence dissipated to a similar level by 4 hours. 

In one embodiment of the invention, the isolated nucleic acid molecule 
comprises a nucleic acid sequence encoding a fusion polypeptide comprising a 
5 reporter protein and at least two destabilization sequences, wherein the nucleic 
acid sequence is a synthetic sequence containing codons preferentially found in a 
particular organism, e.g., in plants or humans, and more preferably in highly 
expressed proteins, for instance, highly expressed human proteins. 

In one preferred embodiment, the invention provides an isolated nucleic 
10 acid molecule comprising a nucleic acid sequence encoding a fusion polypeptide 
comprising a luminescent protein, e.g., a luciferase, and a combination of 
heterologous protein and/or mRNA destabiUzation sequences. Preferably, at 
least the sequence encoding the luminescent protein is optimized for expression 
in human cells. 

15 The invention will be further described by the following non-limiting 

example. 

Example 1 

Materials and Methods 

20 Bacterial Cells and Plasmids 

Escherichia coli JM109 cells were used to propagate plasmids. Bacterial 
cultures were grown routinely in LB broth at 37°C with the addition of 100 
M-g/ml ampicillin or 30 |ag/ml kanamycin when required. Extraction and 
purification of plasmid DNA were perfoimed using Plasmid Maxi Kit (Qiagen). 

25 pGEM®T Easy Vector (Promega) was used to clone PGR products, 

Plasmids pGL3-Basic Vector and pSP-luc+NF Fusion Vector were used as the 
source for cDNA encoding firefly luciferase (Promega). 
DNA modifying enzymes 

Restriction enzymes Agel, Apal, BamKI, BglU, BstEE, Bstm, EcoRI, 

30 EcoKY, Ncol, Notl, Seal, Xbal and JTwwI as well as T4 DNA polymerase and SI 
nuclease were obtained firom Promega. Rapid DNA Ligation Kit and Expand™ 
High FideUty PGR System were supplied by Boehringer Mannheim. 
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Oligonucleotides 

Oligonucleotides used for polymerase chain reactions as well as 
oligonucleotides used for cloning and sequencing are listed in Table 1. All the 
oligonucleotides were synthesized at Promega. 
5 A representative number of these DNA constructs are depicted 

schematically in Figure 1 1 . 

pGEM-Luc5 was constructed by cloning into pGEM*^ Easy Vector a JBragment 

that encodes &efly luciferase, which was amplified firom pGL3-Basic 

Vector using primers LucN and LucC (Table 1). 
10 pLucl 1 was constructed by joining the large Notl-BglR fi-agment of plasmid 

pUbiqGFP23 with the smsll Bgia-Notl firagment of plasmid pGEM-Luc5. 
pETwtLucl is a derivative of plasmid pET28b(+) that contains the small Ncol- 

EcoRY fi-agment of plasmid pSP-luc+NF Fusion Vector instead of the 

Ncol-Ecinen fragment of plasmid pET28b(+). 
15 pwtLucl was generated by joining the large Notl-Scal fragment of plasmid 

pLucl 1 with the small Scal-NotI fragment of plasmid pETwtLucl . 
pLuc-PESTlO was generated by joining the large Notl-EcoRl fragment of 

plasmid pLucl 1 with the synthetic DNA fragment that encodes a mutant 

C-terminal region of the mouse ornithine decarboxylase (mODC), which 
20 synthetic fragment was formed by oligonucleotides PEST-5', PEST-3*, 5 - 

PEST and 3*-PEST (Table 1). 
pTVLucPESTlO was generated by joining the large BglH-Scal fragment of 

plasmid pLuc-PESTlO with the small Scal-BgHl fragment of plasmid 

pETwtLucl. 

25 pLucARIl 7 was constructed by treatment of plasmid pLucl 1 with EcdSl, T4 

DNA polymerase and ligase. 
pSPUbiqLucl was generated by combining ^^ffill-linearized DNA of plasmid 

pSP-luc+NF Fusion Vector with the fragment of plasmid pUbiqGFP23 

which encodes ubiquitin. The ubiquitin fragment was prepared using PGR 
30 and primers Ubiquitin 5* ^BsytEH and Ubiquitin 3' v^lBstEH (Table 1), 

and subsequent treatment with BstEH. 
pETUbiqLuc was constructed by joining the large iVcoI-£'c/l 3611 fragment of 

plasmid pETBirA with the small Ncol-EcoKV fragment of plasmid 

pSPUbiqLucl. 
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pUbiqLuclS was prepared by joining the large fragment of plasmid 

pUbiqGFP23, which was generated by treatment of pUbiqGFP23 with 
Agel, SI nuclease md Xbal, with the small fragment of plasmid pGL3 
Basic Vector, which was generated by treatment of pGLS-Basic Vector 
5 with Ncol, S 1 nuclease and Xbal. 

pUbiq(Y)Lucl9 was generated by combining the IdigQ Xbal-Xmnl fragment of 
plasmid pUbiqGFP23 with ttie small Xbal-Xmnl fragment of plasmid 
pSPUbiqLucl. 

pT7Ubiq(I)LuGl9.1, pT7l]biq(E)Lucl9.1 andpT7Ubiq(M)Lucl9.2 were 

10 generated by combining the large BamHL-Apal fragment of plasmid 

pUbiq(Y)Lucl9 with BaniHl-Apal treated DNA fragments which had been 
ampUfied by PGR using plasmid pUbiqLuclS and primers Ubi-Luc 5' 
w/Linker and Ubi-Luc 3* with linker or Ubi-Luc3' w/Linker Glu or Ubi- 
Luc 3' w/Linker Met, accordingly (Table 1). 

15 pT7Ubiq(Y)Lucl9.2 was generated by joining the large BamBl-XmnI fragment 
of plasmid pT7Ubiq(I)Lucl9.1 with the small BamUl-Xmnl fragment of 
plasmid pUbiq(Y)Lucl9. 
pUbiq(R)Lucl3 was generated by combining the large BstER-Xinnl fragment of 
plasmid pUbiq(Y)Lucl9 with the BstEU-Xmnl treated DNA fragments, 

20 which had been amplified by PGR using plasmid pUbiq(Y)Lucl9 and 

primers Ubiquitin ywt/BsytEU and Ubiq(R) (Table 1). 
pUbiq(A)Luc2, pUbiq(Asp2)Lucl6, pUbiq(F)LuclO, pUbiq(His2)Luc3, 
pUbiq(H)Lucll, pUbiq(L)Luc23, pUbiq(K)Luc4, pUbiq(N)Luc25, 
pUbiq(Q)Luc36 and pUbiq(W)Lucl6 were constructed by combining the 

25 large BstER-Xmnl fragment of plasmid pUbiq(R)Lucl3 with BstEH-Xmnl 

treated DNA fragments which had been amplified by PGR using plasmid 
pUbiq(Y)Lucl9 and primers Ubiquitin yv^BsytEIL and Ala or Asp, or 
Phe, or His2, or His, or Leu, or Lys, or Asn, or Ghi, or Trp, respectively 
(Table 1). 

30 pUbiq(H)ALucl 8 was constructed by treatment of plasmid pUbiq(H)Lucl 1 with 
BstBUy T4 DNA polymerase and ligase. 
pUbiq(E)ALuc6 was generated by joining the large Scal-Xmnl fragment of 
plasmid pUbiq(H)ALucl 8 with a Scal-Xmnl treated PGR amplified 
fragments. The fragments were amphfied from plasmid 
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pT7Ubiq(E)Lucl9.1 as separate DNA fragments using primers Ubiquitin 
ywt/BsytEJI and Ubiq(E)del5» or Ubiq(E)del3' and LucC (Table 1) and 
then those fragments were combined in a separate PGR using primers 
Ubiquitin ywt/BsytEll and LucC. 
5 pT7Ubiq(E)LucPEST23 was generated by joining the large Bst9Sl'Scal 

fragment of plasmid pT7Ubiq(I)Lucl9.1 with the small Bst9Sl'Scal' 
fragment of plasmid pLuc-PESTlO. 
pUbiq(R)Luc-PEST12 and pUbiq(Y)Luc-PEST5 were generated by joining the 
small Bst981''ScaI fragment of plasmid pLuc-PESTlO with the large 
10 Bsmi-Scal fragment of plasmids pUbiq(R)Lucl3 and pUbiq(Y)Lucl9, 

accordingly. 

pGEMhLuc+5 was constructed by cloning into pGEM®"T Easy Vector a 

fragment that encodes firefly luciferase, which fragment was amplified 
using a template with an optimized firefly luciferase sequence and primers 
15 Luc+N and Luc+C (Table 1). 

phLuc+PESTl was generated by joining the small EcdRI-HindUl fragment of 
plasmid pGEMhLuc+5 with the large EcoRI-Hindni fragment of plasmid 
pLuc-PESTlO. 

pT7Ubiq(E)hLuc+PEST80 was generated by joining the small BstBll-Vspl 
20 fragment of plasmid pT7Ubiq(I)Lucl9.1 with the large BstEU-Vspl 

fragment of plasmid phLuc+PESTI . 

A sequence containing the promoter of the human hsp70 gene (Phsp7o) was 
amplified from human chromosomal DNA using PGR and primers 5'- 
ATTAATCTGATCAATAAAGGGTTTAAGG (SEQ ID NO:l) and 5'- 
25 AAAAAGGTAGTGGACTGTCG (SEQ ID NO:2). 

A UTR destabilization sequence was assembled using primers: 5'- 

CTAGATTTATTTATTTATTTCTTCATATGC (SEQ ID NO:3) and 5'- 
AATTGCATATGAAGAAATAAATAAATAAAT (SEQ ID N0:4). 

A BKB destabihzation sequence was assembled using primers: 5'- 
30 AATTGGGAATTAAAACAGCATTGAACCAAGAAGCTTGGCTTTCT 
TATCAATTCTTTGTGACATAATAAGTT (SEQ ID NO:5) and 5'- 
AACTTATTATGTCACAAAGAATTGATAAGAAAGCCAAGCTTCTT 
GGTTCAATGCTGTTTTAATTCCC (SEQ ID N0:6). 
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A mutant mODC PEST sequence 

(HGFPPEMEEQAAGTLPMSCAQESGMDRHPAACASARINV 
(corresponding to resides 423-461 of mODC; SEQ ID N0:7) was 
assembled using primers: 5'- 
5 AATTCTCATGGCTTCCCGCCGGAGATGGAGGAGCAGGCTGCTGG 
CACGCTGCCCATGTCTT (SEQ ED N0:8), 5'- 

GTGCCCAGGAGAGCGGGATGGACCGTCACCCTGCAGCCTGTGCT 

TCTGCTAGGATCAATGTGTAA (SEQ ID N0:9), 5'- 

GGCCTTACACATTGATCCTAGCAGAAGCACAGGCTGCAGGGTG 
10 ACGGTCCATCCCGCTCTCCT (SEQ ID NO:10) and 5'- 

GGGCACAAGACATGGGCAGCGTGCCAGCAGCCTGCTCCTCCATC 

TCCGGCGGGAAGCCATGAG (SEQ ID N0:1 1). 
A CLl sequence (ACKNWFSSLSHFVIHL; SEQ ID NO: 12) was assembled 

using oligonucleotides: 5'- 
1 5 AATTCAAGTGGATCACGAAGTGGCTCAAGCTGCTGAACCAGTTC 

TTGCAGGCAGACA (SEQ ID N0:13) and 5'- 

AATTTGTCTGCCTGCAAGAACTGGTTCAGCAGCTTGAGCCACTT 
CGTGATCCACTTG (SEQ ID NO: 14). 
An optimized PEST sequence (hPEST) has the following sequence: 
20 CACGGCTTCCCtCCCGAGGTGGAGGAGCAGGCCGCCGGCACCCT 
GCCCATGAGCTGCGCCCAGGAGAGCGGCATGGAtaGaCACCaGCt 
. GCtTGCGCCAGCGCCAGGATCAACGTcTAA (SEQ ID NO: 1 5). 
An optimized CLl and hPEST with a UTR sequence has the following 
sequence: 

25 GCtTGCAAGAACTGGTTCAGtAGCtTaAGCCACTTtGTGATCCACCT 
tAACAGCCACGGCTTCCCtCCCGAGGTGGAGGAGCAGGCCGCCG 
GCACCCTGCCCATGAGCTGCGCCCAGGAGAGCGGCATGGAtaGaC 
ACCQGCtGCtTGCGCCAGCGCCAGGATCAACGTcTAg (SEQ ID 
NO:46). 

30 pGEM-hRL3 was constructed by cloning into pGEM®T Easy Vector a PGR 

amplified optimized sequence that encodes Renilla luciferase, which was 
amplified firom phRL-TK using primers hRLN and hRLC (Table 1). 
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phRL-PEST15 was generated by joining the large HindBI-EcdRI fragment of 
plasmid pLuc-PESTlO with the small Hindni-EcoRI fragment of plasmid 
pGEM-hRL3. 

phRLARI-PESTl was constructed by treatment of plasmid phRL-PEST15 with 
5 EcoRI, T4 DNA polymerase and Ugase. 

pT7lJbiq(E)hRL-PEST65 and pUbiq(R)hRL-PEST45 were generated by joining 
the large BstEU-VspI fragment of plasmid phRL-PEST15 with the small 
BstESI-VspI fragment of plasmids pT7Ubiq(E)Lucl9.1 and 
pUbiq(R)Luc 1 3 , accordingly. 
1 0 pUbiq(A)hRLl , pUbiq(H)hRLl , pUbiq(F)hRLl were generated by joining the 
large Bst98l-BstEIL fragment of plasmid phRLARI-PESTl with the small 
Bst9SI''BstEIl fragment of plasmids pUbiq(A)Luc2 or pUbiq(His2)Luc3 or 
pUbiq(F)LuclO, accordingly. 
pUbiq(E)hRLl and pUbiq(R)hRLl were created by joining the large HindUl' 
EcoRV fragment of plasmid phRLARI-PESTl with the small Hindni- 
EcoRY fragment of plasmids pT7Ubiq(E)hRL-PEST65 or pUbiq(R)hRL- 
PEST45, accordingly. 
pGEM-tetOl was constructed by cloning into pGEM®T Easy Vector a PGR 
amphfied sequence that encodes a hCMV minimal promoter with 
heptamerized upstream tet-operators (Gossen, 1992), which was amphfied 
frompUHD 10-3 using primers tetO-3' andtetO-5' (Table 1). 
ptetO-hRL9 was generated by treatment of plasmid ptetO-hRLl-PESTl with 

endonuclease EcoRI, T4 DNA polymerase and hgase. 
ptetO-hRL-PESTl was generated by joining the large Nhel-Vspl fragment of 
plasmid phRL-PEST15 with the small Nhel-Vspl fragment of plasmid 
pGEM-tetOl. 

ptetO-T7Ubiq(E)hRL-PEST15 was generated by joining the large Nliel-Vspl 
fragment of plasmid pT7Ubiq(E)hRL-PEST65 with the small Nhel-Vspl 
fragment of plasmid pGEM-tetO 1 . 
ptetO-Ubiq(E)hRL-PEST6 was constructed by treatment of plasmid ptetO- 

T7Ubiq(E)hRL-PEST15 with Xbal and EcoAlJH, T4 DNA polymerase and 
ligase. 
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ptetO-Ubiq(E)hRL-PEST-UTR13 was created by joining the large Muni-Xbal 
fragment of plasmid ptetO-Ubiq(E)hRL-PEST6 with the adaptor formed 
by oligonucleotides AUUU and anti-AUUU (Table 1). 

ptetO-hRL-PEST-UTR12 was created by joining the large Pstl-Kpnl jfragment of 
5 plasmid ptetO-XJbiq(E)hRL-PEST-UTR13 with the small Pstl-Kpnl 

fragment of plasmid ptetO-hRL-CLl-PESTl 1. 

ptetO-Ubiq(E)hRL-PEST-BKB24 was created by joining the laige Muni-Hpal 
fragment of plasmid ptetO-T7Ubiq(E)hRL-PEST15 with the adaptor 
formed by oligonucleotides 3'-BKBl and 5'-BKBlrev (Table 1). 
1 0 ptetO-T7Ubiq(E)hRL-PEST-UTR-BKB8 was generated by joining the large 

Nhel-Muni fragment of plasmid ptetO-Ubiq(E)hRL-PEST-BKB24 with the 
small Nliel-Muni fragment of plasmid ptetO-Ubiq(E)hRL-PEST-UTR16. 

ptetO-Ubiq(E)hRL-PEST-UTR16 was generated by joining the large Munl-Xbal 
fragment of plasmid ptetO-Ubiq(E)hRL-PEST6 with the DNA fragment 
1 5 formed by oligonucleotides AUUU (SEQ ID N0:3) and Anti-AUUU (SEQ 

IDN0:4). 

ptetO-hRL-CLl-PESTll was generated by joining the large EcoRl fragment of 
plasmid ptetO-hRL-PESTl with the DNA fragment formed by 
oligonucleotides CLl-N-final (SEQ E) NO:64) and Rev-CLl-N-final 
20 (SEQIDNO:65). 

ptetO-hRL-CLl-PEST-UTRl was generated by joining the large Pstl-Kpnl 

fragment of plasmid ptetO-Ubiq(E)hRL-PEST-UTRl 6 with the small Pstl- 
Kpnl fragment of plasmid ptetO-hRL-CLl-PESTll. 
pGEM-Phsp70-3 was constructed by cloning into pGEM®T Easy Vector a PGR 
25 amplified sequence which was amplified from human DNA using primers 

hsp70-5' andhsp70-3' (Table 1). 
pPhsp70-hRL-PEST15 was generated by joining the large Nhel-Vspl fragment of 
plasmid ptetO-hRL-PESTl with the small Nhel-Vspl fragment of plasmid 
pGEM-Phsp70-3. 

30 pPhsp70-hRL7 was constructed by treatment of plasmid pPhsp70-hRL-PESTl 5 
with EcoRl, T4 DNA polymerase and hgase. 
pPhsp70-Ubiq(E)hRL-PBSTl was generated by joining the large Nliel-Vspl 
fragment of plasmid ptetO^Ubiq(E)hRL-PEST6 with the small Nhel-Vspl 
fragment of plasmid pGEM-Phsp70-3. 
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pPhsp70-Ubiq(E)hRL-PEST-UTR10 was generated by joining the large Nhel- 
Vspl fragment of plasmid ptetO-Ubiq(E)hRL-PEST-UTR16 with the small 
NJiel-Vspl fragment of plasmid pGEM-Phsp70-3. 
pPhsp70-T7lJbiq(E)hRL-PEST-BKB5 was generated by joining the large Nhel- 
5 Vspl fragment of plasmid ptetO-T7Ubiq(E)hRL-PEST-BKB24 with the 

small Nhel'Vspl fragment of plasmid pGEM-Phsp70-3. 
pPhsp70-T7Ubiq(E)hRL-PEST-UTR-BKB7 was generated by joining the large 
Nhel-Vspl fragment of plasmid ptetO-T7Ubiq(E)hRL-PEST-UTR-BKB 8 
with the small NJiel-Vspl fragment of plasmid pGEM-Phsp70-3. 
10 pLucCLl-25 was generated by joining the large EcoRI-Notl fragment of plasmid 
pLucl 1 with the DNA fragment formed by oligonucleotides CLl (SEQ ID 
NO:62) and Rev-CLl (SEQ ID NO:63). 
pLucCLl-PEST9 was generated by joining the large EcoRI fragment of plasmid 
pLuc-PESTlO with the DNA fragment formed by oUgonucleotides CLl-N- 
15 fmal (SEQ ID NO:64) and Rev-CLl -N-final (SEQ ID NO:65). 

pCLl-Lucl was generated by joining the large Hindni-Bglll fragment of 

plasmid pLucARI17 with the DNA fragment formed by oligonucleotides 
CLl-N (SEQ ID NO:64) and Rev-CLl-N (SEQ ID NO:65). 
phLuc+CLl-PEST13 was generated by joining the large EcoRl fragment of 
20 plasmid phLuc+PESTl with the DNA fragment formed by 

oligonucleotides CLl-N-final (SEQ ID NO:64) and Rev-CLl -N-final 
(SEQ ID NO:65). 

pPhsp70hLuc+PEST2 was generated by joining the large EcoRl-Nhel fragment 
of plasmid phLuc+PESTl with the small EcoRI-Nhel fragment of plasmid 
25 pPhsp70hRL-PEST15. 

pPhsp70hLuc+14 was constructed by treatment of plasmid 

pPhsp70hLuc+PEST2 with EcoRI, T4 DNA polymerase and ligase. 

pPhsp70hRL-CLl-PEST-UTR4 was generated by joining the large Vspl-Nhel 
fragment of plasmid ptetO-hRL-CLl-PEST-UTRl with the small Vspl- 
30 Nhel fragment of plasmid pGEM-Phsp70-3. 

pPhsp70hLuc+CLl-PEST12, pPhsp70hLuc4-CLl-PEST-UTR5 were generated 
by joining the small EcoKL-Nhel fragment of plasmid phLuc+CLl-PEST13 
with large EcoRI-NIiel fragments of plasmids pPhsp70hRL-PESTl 5 and 
pPhsp70hRL-CLl-PEST-UTR4, respectively. 
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pPhsp70MhLuc+27, pPhsp70MhLuc4-PEST25, pPhsp70MhLuc+CLl-PEST32 
and pPhsp70MhLuc+CLl-PEST-UTRl9 were constructed by cloning 
DNA fragment formed by oligonucleotides N-M and M-C (Table 1) into 
plasmids pPhsp70hLuc+14, pPhsp70hLuc+PEST2, pPhsp70hLuc+CLl- 
5 PEST12 and pPhsp70hLuc+CLl-PEST-UTR5, respectively, that were 

treated with BstEU and BglR, 

phRL-PEST14 was constructed by joining the large EcoRV-Nhel fragment of 
plasmid phRL-PEST15 with the small EcoRW-Nhel fragment of plasmid 
phRL-TK. 

10 pGL3-hRL-PEST3 was constructed by joining the large Bst98l-Xbal fragment of 

plasmid pGL3-Ubiq(E)hRL-PEST2 with the small Bst9Sl-Xbal fragment 

of plasmid phRL-PEST14. 

pGL3-hRLl 1 was constructed by joining the large Bst9Sl-EcoRY fragment of 

plasmid pGL3-hRL-PEST3 with the small Bsmi-EcoRV fragment of 

15 plasmid phRL3. 

pGL3-hRL-CLl-PEST-UTR23 was constructed by joining the large Bsmi- 

EcoRY fragment of plasmid pGL3-hRL-PEST3 with the small ^8^^981- 

EcoRY fragment of plasmid ptetO-hRL-CLl-PEST-UTRl. 

pGL3-hRL-PEST-UTR6 was constructed by joining the large Bst9Sl-EcoRY 

20 fragment of plasmid pGL3-hRL-PEST3 with the small Bst98l-EcoRY 

fragment of plasmid ptetO-Ubiq(E)hRL-PEST-UTR16. 

pGL3-hRL-CLl-PEST7 was constructed by joining the large Bsmi-EcoRY 

fragment of plasmid pGL3-hRL-PEST3 with the small Bst9Sl-EcoRY 

fragment of plasmid ptetO-hRL-CLl -PESTl 1 . 

25 An optimized Renilla luciferase DNA has the following sequence; 

atggcttccaaggtgtacgaccccgagcaacgcaaacgcatgatcactgggcctcagtggtgggctcgctgcaag 
c 

aaatgaacgtgctggactccttcatcaactactatgattccgagaagcacgccgagaacgccgtgatttttctgca 
tggtaacgctgcctccagctacctgtggaggcacgtcgtgcctcacatcgagcccgtggctagatgcatcatccct 
30 gatctgatcggaatgggtaagtccggcaagagcgggaatggctcatatcgcctcctggatcactacaagtacctca 
ccgcttggttcgagctgctgaaccttccaaagaaaatcatctttgtgggccacgactggggggcttgtctggcctt 
tcactactcctacgagcaccaagacaagatcaaggccatcgtccatgctgagagtgtcgtggacgtgatcgagtcc 
tgggacgagtggcctgacatcgaggaggatatcgccctgatcaagagcgaagagggcgagaaaatggtgcttga 
ga 

35 ataacttcttcgtcgagaccatgctcccaagcaagatcatgcggaaactggagcctgaggagttcgctgcctacct 
ggagccattcaaggagaagggcgaggttagacggcctaccctctcctggcctcgcgagatccctctcgttaaggg 
a 

ggcaagcccgacgtcgtccagattgtccgcaactacaacgcctaccttcgggccagcgacgatctgcctaagatgt 
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tcatcgagtccgaccctgggttcttttccaacgctattgtcgagggagctaagaagttccctaacaccgagttcgt 
gaaggtgaagggcctccacttcagccaggaggacgctccagatgaaatgggtaagtacatcaagagcttcgtgga 

g 

cgcgtgctgaagaacgagcagtaa (SEQ ID NO:47). 

An optimized firefly luciferase DNA has the following sequence: 

atggccgatgctaagaacattaagaagggccctgctcccttctaccctctggaggatggcaccgctggcgagcag 
c 

tgcacaaggccatgaagaggtatgccctggtgcctggcaccattgccttcaccgatgcccacattgaggtggacat 
cacctatgccgagtacttcgagatgtctgtgcgcctggccgaggccatgaagaggtacggcctgaacaccaacca 
c 

cgcatcgtggtgtgctctgagaactctctgcagttcttcatgccagtgctgggcgccctgttcatcggagtggccg 

tggcccctgctaacgacatttacaacgagcgcgagctgctgaacagcatgggcatttctcagcctaccgtggtgtt 

cgtgtctaagaagggcctgcagaagatcctgaacgtgcagaagaagctgcctatcatccagaagatcatcatcatg 

gactctaagaccgactaccagggcttccagagcatgtacacattcgtgacatctcatctgcctcctggcttcaacg 

agtacgacttcgtgccagagtctttcgacagggacaaaaccattgccctgatcatgaacagctctgggtctaccgg 

cctgcctaagggcgtggccctgcctcatcgcaccgcctgtgtgcgcttctctcacgcccgcgaccctattttcggc 

aaccagatcatccccgacaccgctattctgagcgtggtgccattccaccacggcttcggcatgttcaccaccctgg 

gctacctgatttgcggctttcgggtggtgctgatgtaccgcttcgaggaggagctgttcctgcgcagcctgcaaga 

ctacaaaattcagtctgccctgctggtgccaaccctgttcagcttcttcgctaagagcaccctgatcgacaagtac 

gacctgtctaacctgcacgagattgcctctggcggcgccccactgtctaaggaggtgggcgaagccgtggccaa 

gctttcatctgccaggcatccgccagggctacggcctgaccgagacaaccagcgccattctgattaccccagagg 
g 

cgacgacaagcctggcgccgtgggcaaggtggtgccattcttcgaggccaaggtggtggacctggacaccggc 
aag 

accctgggagtgaaccagcgcggcgagctgtgtgtgcgcggccctatgattatgtccggctacgtgaataaccctg 

aggccacaaacgccctgatcgacaaggacggctggctgcactctggcgacattgcctactgggacgaggacga 

gca 

cttcttcatcgtggaccgcctgaagtctctgatcaagtacaagggctaccaggtggccccagccgagctggagtct 

atcctgctgcagcaccctaacattttcgacgccggagtggccggcctgcccgacgacgatgccggcgagctgcct 

g 

ccgccgtcgtcgtgctggaacacggcaagaccatgaccgagaaggagatcgtggactatgtggccagccaggtg 
ac 

aaccgccaagaagctgcgcggcggagtggtgttcgtggacgaggtgcccaagggcctgaccggcaagctgga 
cgcccgcaagatccgcgagatcctgatcaaggctaagaaaggcggcaagatcgccgtgtaa (SEQ ID 
NO:48). 

An optimized mutant firefly luciferase DNA has the following sequence: 

atggccgatgctaagaacattaagaagggccctgctcccttctaccctctggaggatggcaccgctggcgagcag 
c 

tgcacaaggccatgaagaggtatgccctggtgcctggcaccattgccttcaccgatgcccacattgaggtggacat 
cacctatgccgagtacttcgagatgtctgtgcgcctggccgaggccatgaagaggtacggcctgaacaccaacca 
c 

cgcatcgtggtgtgctctgagaactctctgcagttcttcatgccagtgctgggcgccctgttcatcggagtggccg 

tggcccctgctaacgacatttacaacgagcgcgagctgctgaacagcatgggcatttctcagcctaccgtggtgtt 

cgtgtctaagaagggcctgcagaagatcctgaacgtgcagaagaagctgcctatcatccagaagatcatcatcatg 

gactctaagaccgactaccagggcttccagagcatgtacacattcgtgacatctcatctgcctcctggcttcaacg 

agtacgacttcgtgccagagtctttcgacagggacaaaaccattgccctgatcatgaacagctctgggtctaccgg 
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cctgcctaagggcgtggccctgacccatcgcaacgcctgtgtgcgcttctctcacgcccgcgaccctattttcggc 
aaccagatcatccccgacaccgctattctgagcgtggtgccattccaccacggcttcggcatgttcaccaccctgg 
gctacctgatttgcggctttcgggtggtgctgatgtaccgcttcgaggaggagctgttcctgcgcagcctgcaaga 
ctacaaaattcagtctgccctgctggtgccaaccctgttcagcttcttcgctaagagcaccctgatcgacaagtac 
5 gacctgtctaacctgcacgagattgcctctggcggcgccccactgtctaaggaggtgggcgaagccgtggccaa 
gc 

gctttcatctgccaggcatccgccagggctacggcctgaccgagacaaccagcgccattctgattaccccagagg 
g 

cgacgacaagcctggcgccgtgggcaaggtggtgccattcttcgaggccaaggtggtggacctggacaccggc 
10 aag 

accctgggagtgaaccagcgcggcgagctgtgtgtgcgcggccctatgattatgtccggctacgtgaataaccctg 

aggccacaaacgccctgatcgacaaggacggctggctgcactctggcgacattgcctactgggacgaggacga 

gca 

cttcttcatcgtggaccgcctgaagtctctgatcaagtacaagggctaccaggtggccccagccgagctggagtct 
15 atcctgctgcagcaccctaacattttcgacgccggagtggccggcctgcccgacgacgatgccggcgagctgcct 
g 

ccgccgtcgtcgtgctggaacacggcaagaccatgaccgagaaggagatcgtggactatgtggccagccaggtg 
ac 

aaccgccaagaagctgcgcggcggagtggtgttcgtggacgaggtgcccaagggcctgaccggcaagctgga 
20 cgcccgcaagatccgcgagatcctgatcaaggctaagaaaggcggcaagatcgccgtgtaa (SEQ ID 
NO:49). 

Other optimized firefly luciferase sequences include hluc+: 

25 ATGGCCGATGCTAAGAACATTAAGAAGGGCCCTGCTCCCTTCTACCC 
TCTGGAGGATGGCACCGCTGGCGAGCAGCTGCACAAGGCCATGAAG 
AGGTATGCCCTGGTGCCTGGCACCATTGCCTTCACCGATGCCCACATT 
GAGGTGGACATCACCTATGCCGAGTACTTCGAGATGTCTGTGCGCCT 
GGCCGAGGCCATGAAGAGGTACGGCCTGAACACCAACCACCGCATC 

30 GTGGTGTGCTCTGAGAACTCTCTGCAGTTCTTCATGCCAGTGCTGGGC 
GCCCTGTTCATCGGAGTGGCCGTGGCCCCTGCTAACGACATTTACAA 
CGAGCGCGAGCTGCTGAACAGCATGGGCATTTCTCAGCCTACCGTGG 
TGTTCGTGTCTAAGAAGGGCCTGCAGAAGATCCTGAACGTGCAGAAG 
AAGCTGCCTATCATCCAGAAGATCATCATCATGGACTCTAAGACCGA 

35 CTACCAGGGCTTCCAGAGCATGTACACATTCGTGACATCTCATCTGCC 
TCCTGGCTTCAACGAGTACGACTTCGTGCCAGAGTCTTTCGACAGGG 
ACAAAACCATTGCCCTGATCATGAACAGCTCTGGGTCTACCGGCCTG 
CCTAAGGGCGTGGCCCTGCCCCATCGCACCGCCTGTGTGCGCTTCTCT 
CACGCCCGCGACCCTATTTTCGGCAACCAGATCATCCCCGACACCGC 

40 TATTCTGAGCGTGGTGCCATTCCACCACGGCTTCGGCATGTTCACCAC 
CCTGGGCTACCTGATTTGCGGCTTTCGGGTGGTGCTGATGTACCGCTT 
CGAGGAGGAGCTGTTCCTGCGCAGCCTGCAAGACTACAAAATTCAGT 
CTGCCCTGCTGGTGCCAACCCTGTTCAGCTTCTTCGCTAAGAGCACCC 
TGATCGACAAGTACGACCTGTCTAACCTGCACGAGATTGCCTCTGGC 

45 GGCGCCCCACTGTCTAAGGAGGTGGGCGAAGCCGTGGCCAAGCGCTT 
TCATCTGCCAGGCATCCGCCAGGGCTACGGCCTGACCGAGACAACCA 
GCGCCATTCTGATTACCCCAGAGGGCGACGACAAGCCTGGCGCCGTG 
GGCAAGGTGGTGCCATTCTTCGAGGCCAAGGTGGTGGACCTGGACAC 
CGGCAAGACCCTGGGAGTGAACCAGCGCGGCGAGCTGTGTGTGCGC 

50 GGCCCTATGATTATGTCCGGCTACGTGAATAACCCTGAGGCCACAAA 
CGCCCTGATCGACAAGGACGGCTGGCTGCACTCTGGCGACATTGCCT 
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ACTGGGACGAGGACGAGCACTTCTTCATCGTGGACCGCCTGAAGTCT 
CTGATCAAGTACAAGGGCTACCAGGTGGCCCCAGCCGAGCTGGAGTC 
TATCCTGCTGCAGCACCCTAACATTTTCGACGCCGGAGTGGCCGGCCT 
GCCCGACGACGATGCCGGCGAGCTGCCTGCCGCCGTCGTCGTGCTGG 
5 AACACGGCAAGACCATGACCGAGAAGGAGATCGTGGACTATGTGGC 
CAGCCAGGTGACAACCGCCAAGAAGCTGCGCGGCGGAGTGGTGTTC 
GTGGACGAGGTGCCCAAGGGCCTGACCGGCAAGCTGGACGCCCGCA 
AGATCCGCGAGATCCTGATCAAGGCTAAGAAAGGCGGCAAGATCGC 
CGTGTAA (SEQ ID NO: 66); 

10 

and hLuc+ (5F2): 

ATGGCCGATGCTAAGAACATTAAGAAGGGCCCTGCTCCCTTCTACCC 
TCTGGAGGATGGCACCGCTGGCGAGCAGCTGCACAAGGCCATGAAG 

15 AGGTATGCCCTGGTGCCTGGCACCATTGCCTTCACCGATGCCCACATT 
GAGGTGGACATCACCTATGCCGAGTACTTCGAGATGTCTGTGCGCCT 
GGCCGAGGCCATGAAGAGOTACGGCCTGAACACCAACCACCGCATC 
GTGGTGTGCTCTGAGAACTCTCTGCAGTTCTTCATGCCAGTGCTGGGC 
GCCCTGTTCATCGGAGTGGCCGTGGCCCCTGCTAACGACATTTACAA 

20 CGAGCGCGAGCTGCTGAACAGCATGGGCATTTCTCAGCCTACCGTGG 
TGTTCGTGTCTAAGAAGGGCCTGCAGAAGATCCTGAACGTGCAGAAG 
AAGCTGCCTATCATCCAGAAGATCATCATCATGGACTCTAAGACCGA 
CTACCAGGGCTTCCAGAGCATGTACACATTCGTGACATCTCATCTGCC 
TCCTGGCTTCAACGAGTACGACTTCGTGCCAGAGTCTTTCGACAGGG 

25 ACAAAACCATTGCCCTGATCATGAACAGCTCTGGGTCTACCGGCCTG 
CCTAAGGGCGTGGCCCTGACCCATCGCAACGCCTGTGTGCGCTTCTCT 
CACGCCCGCGACCCTATTTTCGGCAACCAGATCATCCCCGACACCGC 
TATTCTGAGCGTGGTGCCATTCCACCACGGCTTCGGCATGTTCACCAC 
CCTGGGCTACCTGATTTGCGGCTTTCGGGTGGTGCTGATGTACCGCTT 

30 CGAGGAGGAGCTGTTCCTGCGCAGCCTGCAAGACTACAAAATTCAGT 
CTGCCCTGCTGGTGCCAACCCTGTTCAGCTTCTTCGCTAAGAGCACCC 
TGATCGACAAGTACGACCTGTCTAACCTGCACGAGATTGCCTCTGGC 
GGCGCCCCACTGTCTAAGGAGGTGGGCGAAGCCGTGGCCAAGCGCTT 
TCATCTGCCAGGCATCCGCCAGGGCTACGGCCTGACCGAGACAACCA 

35 GCGCCATTCTGATTACCCCAGAGGGCGACGACAAGCCTGGCGCCGTG 
GGCAAGGTGGTGCCATTCTTCGAGGCCAAGGTGGTGGACCTGGACAC 
CGGCAAGACCCTGGGAGTGAACCAGCGCGGCGAGCTGTGTGTGCGC 
GGCCCTATGATTATGTCCGGCTACGTGAATAACCCTGAGGCCACAAA 
CGCCCTGATCGACAAGGACGGCTGGCTGCACTCTGGCGACATTGCCT 

40 ACTGGGACGAGGACGAGCACTTCTTCATCGTGGACCGCCTGAAGTCT 
CTGATCAAGTACAAGGGCTACCAGGTGGCCCCAGCCGAGCTGGAGTC 
TATCCTGCTGCAGCACCCTAACATTTTCGACGCCGGAGTGGCCGGCCT 
GCCCGACGACGATGCCGGCGAGCTGCCTGCCGCCGTCGTCGTGCTGG 
AACACGGCAAGACCATGACCGAGAAGGAGATCGTGGACTATGTGGC 

45 CAGCCAGGTGACAACCGCCAAGAAGCTGCGCGGCGGAGTGGTGTTC 
GTGGACGAGGTGCCCAAGGGCCTGACCGGCAAGCTGGACGCCCGCA 
AGATCCGCGAGATCCTGATCAAGGCTAAGAAAGGCGGCAAGATCGC 
CGTGTAA (SEQ ID NO: 49). 

50 An optimized GFP sequence has the following sequence: 
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ATGGGCGTGATCAAGCCCGACATGAAGATCAAGCTGCGgATGGAGGG 

CGCCGTGAACGGCCACAAaTTCGTGATCGAGGGCGACGGgAAaGGCA 

AGCCCTTtGAGGGtAAGCAGACtATGGACCTGACCGTGATCGAGGGCG 

CCCCCCTGCCCTTCGCtTAtGACATtCTcACCACCGTGTTCGACTACGGt 

AACCGtGTcTTCGCCAAGTACCCCAAGGACATCCCtGACTACTTCAAGC 

AGACCTTCCCCGAGGGCTACtcgTGGGAGCGaAGCATGACaTACGAGG 

ACCAGGGaATCTGtATCGQACaAACGACATCACCATGATGAAGGGtGT 

GGACGACTGCTTCGTGTACAAaATCCGCTTCGACGGgGTcAACTTCCCt 

GCtAAtGGCCCgGTgATGCAGCGCAAGACCCTaAAGTGGGAGCCCAGtA 

CCGAGAAGATGTACGTGCGgGACGGCGTaCTGAAGGGCGAtGTtAAtAT 

GGCaCTGCTctTGGAGGGaGGCGGCCACTACCGCTGCGACTTCAAGACC 

ACCTACAAaGCCAAGAAGGTGGTGCAGCTtCCCGACTACCACTTCGTG 

GACCACCGCATCGAGATCGTGAGCCACGACAAGGACTACAACAAaGT 

cAAGCTGTACGAGCACGCCGAaGCCCACAGCGGaCTaCCCCGCCAGGC 

CggcTAA (SEQ ID NO: 68). 



An optimized firefly luciferase (liluc+(5f2))-optimized PEST sequence 
(hluc+(5f2)-liPEST) has the following sequence: 

atggccgatgctaagaacattaagaagggccctgctcccttctaccctctggaggatggcaccgctggcgagcag 

ctgcacaaggccatgaagaggtatgccctggtgcctggcaccattgccttcaccgatgcccacattgaggtggaca 

tcacctatgccgagtacttcgagatgtctgtgcgcctggccgaggccatgaagaggtacggcctgaacaccaacc 

accgcatcgtggtgtgctctgagaactctctgcagttcttcatgccagtgctgggcgccctgttcatcggagtggccg 

tggcccctgctaacgacatttacaacgagcgcgagctgctgaacagcatgggcatttctcagcctaccgtggtgttc 

gtgtctaagaagggcctgcagaagatcctgaacgtgcagaagaagctgcctatcatccagaagatcatcatcatgg 

actctaagaccgactaccagggcttccagagcatgtacacattcgtgacatctcatctgcctcctggcttcaacgagt 

acgacttcgtgccagagtctttcgacagggacaaaaccattgccctgatcatgaacagctctgggtctaccggcctg 

cctaagggcgtggccctgacccatcgcaacgcctgtgtgcgcttctctcacgcccgcgaccctattttcggcaacc 

agatcatccccgacaccgctattctgagcgtggtgccattccaccacggcttcggcatgttcaccaccctgggctac 

ctgatttgcggctttcgggtggtgctgatgtaccgcttcgaggaggagctgttcctgcgcagcctgcaagactacaa 

aattcagtctgccctgctggtgccaaccctgttcagcttcttcgctaagagcaccctgatcgacaagtacgacctgtct 

aacctgcacgagattgcctctggcggcgccccactgtctaaggaggtgggcgaagccgtggccaagcgctttcat 

ctgccaggcatccgccagggctacggcctgaccgagacaaccagcgccattctgattaccccagagggcgacg 

acaagcctggcgccgtgggcaaggtggtgccattcttcgaggccaaggtggtggacctggacaccggcaagac 

cctgggagtgaaccagcgcggcgagctgtgtgtgcgcggccctatgattatgtccggctacgtgaataaccctga 

ggccacaaacgccctgatcgacaaggacggctggctgcactctggcgacattgcctactgggacgaggacgag 

cacttcttcatcgtggaccgcctgaagtctctgatcaagtacaagggctaccaggtggccccagccgagctggagt 

ctatcctgctgcagcaccctaacattttcgacgccggagtggccggcctgcccgacgacgatgccggcgagctgc 

ctgccgccgtcgtcgtgctggaacacggcaagaccatgaccgagaaggagatcgtggactatgtggccagccag 

gtgacaaccgccaagaagctgcgcggcggagtggtgttcgtggacgaggtgcccaagggcctgaccggcaag 

ctggacgcccgcaagatccgcgagatcctgatcaaggctaagaaaggcggcaagatcgccgtgaattctCAC 

GGCTTCCCtCCCGAGGTGGAGGAGCAGGCCGCCGGCACCCTGCCCAT 

GAGCTGCGCCCAGGAGAGCGGCATGGAtaGaCACCCtGCtGCtTGCGCC 

AGCGCCAGGATCAACGTcTAA (SEQ ID NO: 69). 

An optimized firefly luciferase(hluc+(5£2))-optimized CLl-optimzed PEST 
sequence (hluc+(5f2)-hCLl-hPEST) has the following sequence: 

atggccgatgctaagaacattaagaagggccctgctcccttctaccctctggaggatggcaccgctggcgagcag 
ctgcacaaggccatgaagaggtatgccctggtgcctggcaccattgccttcaccgatgcccacattgaggtggaca 
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tcacctatgccgagtacttcgagatgtctgtgcgcctggccgaggccatgaagaggtacggcctgaacaccaacc 
accgcatcgtggtgtgctctgagaactctctgcagttcttcatgccagtgctgggcgccctgttcatcggagtggccg 
tggcccctgctaacgacatttacaacgagcgcgagctgctgaacagcatgggcatttctcagcctaccgtggtgttc 
gtgtctaagaagggcctgcagaagatcctgaacgtgcagaagaagctgcctatcatccagaagatcatcatcatgg 
5 actctaagaccgactaccagggcttccagagcatgtacacattcgtgacatctcatctgcctcctggcttcaacgagt 
acgacttcgtgccagagtctttcgacagggacaaaaccattgccctgatcatgaacagctctgggtctaccggcctg 
cctaagggcgtggccctgacccatcgcaacgcctgtgtgcgcttctctcacgcccgcgaccctattttcggcaacc 
agatcatccccgacaccgctattctgagcgtggtgccattccaccacggcttcggcatgttcaccaccctgggctac 
ctgatttgcggctttcgggtggtgctgatgtaccgcttcgaggaggagctgttcctgcgcagcctgcaagactacaa 

10 aattcagtctgccctgctggtgccaaccctgttcagcttcttcgctaagagcaccctgatcgacaagtacgacctgtct 
aacctgcacgagattgcctctggcggcgccccactgtctaaggaggtgggcgaagccgtggccaagcgctttcat 
ctgccaggcatccgccagggctacggcctgaccgagacaaccagcgccattctgattaccccagagggcgacg 
acaagcctggcgccgtgggcaaggtggtgccattcttcgaggccaaggtggtggacctggacaccggcaagac 
cctgggagtgaaccagcgcggcgagctgtgtgtgcgcggccctatgattatgtccggctacgtgaataaccctga 

15 ggccacaaacgccctgatcgacaaggacggctggctgcactctggcgacattgcctactgggacgaggacgag 
cacttcttcatcgtggaccgcctgaagtctctgatcaagtacaagggctaccaggtggccccagccgagctggagt 
ctatcctgctgcagcaccctaacattttcgacgccggagtggccggcctgcccgacgacgatgccggcgagctgc 
ctgccgccgtcgtcgtgctggaacacggcaagaccatgaccgagaaggagatcgtggactatgtggccagccag 
gtgacaaccgccaagaagctgcgcggcggagtggtgttcgtggacgaggtgcccaagggcctgaccggcaag 

20 ctggacgcccgcaagatccgcgagatcctgatcaaggctaagaaaggcggcaagatcgccgtgaattctGCtT 
GCAAGAACTGGTTCAGtAGCtTaAGCCACTTtGTGATCCACCTtAACAGC 
CACGGCTTCCCtCCCGAGGTGGAGGAGCAGGCCGCCGGCACCCTGCC 
CATGAGCTGCGCCCAGGAGAGCGGCATGGAtaGaCACCCtGCtGCtTGCG 
CCAGCGCCAGGATCAACGTcTAg (SEQ ID NO: 70). 

25 

An optimized firefly luciferase (hluc+)-optmiized PEST sequence (hluc+- 
hPEST) has the following sequence: 

atggccgatgctaagaacattaagaagggccctgctcccttctaccctctggaggatggcaccgctggcgagcag 

30 ctgcacaaggccatgaagaggtatgccctggtgcctggcaccattgccttcaccgatgcccacattgaggtggaca 
tcacctatgccgagtacttcgagatgtctgtgcgcctggccgaggccatgaagaggtacggcctgaacaccaacc 
accgcatcgtggtgtgctctgagaactctctgcagttcttcatgccagtgctgggcgccctgttcatcggagtggccg 
tggcccctgctaacgacatttacaacgagcgcgagctgctgaacagcatgggcatttctcagcctaccgtggtgttc 
gtgtctaagaagggcctgcagaagatcctgaacgtgcagaagaagctgcctatcatccagaagatcatcatcatgg 

35 actctaagaccgactaccagggcttccagagcatgtacacattcgtgacatctcatctgcctcctggcttcaacgagt 
acgacttcgtgccagagtctttcgacagggacaaaaccattgccctgatcatgaacagctctgggtctaccggcctg 
cctaagggcgtggccctgcctcatcgcaccgcctgtgtgcgcttctctcacgcccgcgaccctattttcggcaacca 
gatcatccccgacaccgctattctgagcgtggtgccattccaccacggcttcggcatgttcaccaccctgggctacc 
tgatttgcggctttcgggtggtgctgatgtaccgcttcgaggaggagctgttcctgcgcagcctgcaagactacaaa 

40 attcagtctgccctgctggtgccaaccctgttcagcttcttcgctaagagcaccctgatcgacaagtacgacctgtcta 
acctgcacgagattgcctctggcggcgccccactgtctaaggaggtgggcgaagccgtggccaagcgctttcatc 
tgccaggcatccgccagggctacggcctgaccgagacaaccagcgccattctgattaccccagagggcgacga 
caagcctggcgccgtgggcaaggtggtgccattcttcgaggccaaggtggtggacctggacaccggcaagacc 
ctgggagtgaaccagcgcggcgagctgtgtgtgcgcggccctatgattatgtccggctacgtgaataaccctgag 

45 gccacaaacgccctgatcgacaaggacggctggctgcactctggcgacattgcctactgggacgaggacgagca 
cttcttcatcgtggaccgcctgaagtctctgatcaagtacaagggctaccaggtggccccagccgagctggagtct 
atcctgctgcagcaccctaacattttcgacgccggagtggccggcctgcccgacgacgatgccggcgagctgcct 
gccgccgtcgtcgtgctggaacacggcaagaccatgaccgagaaggagatcgtggactatgtggccagccaggt 
gacaaccgccaagaagctgcgcggcggagtggtgttcgtggacgaggtgcccaagggcctgaccggcaagct 

50 ggacgcccgcaagatccgcgagatcctgatcaaggctaagaaaggcggcaagatcgccgtgaattctCACG 
GCTTCCaCCCGAGGTGGAGGAGCAGGCCGCCGGCACCCTGCCCATG 
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AGCTGCGCCCAGGAGAGCGGCATGGAtaGaCACCCtGCtGCtTGCGCCA 
GCGCCAGGATCAACGTcTAA (SEQ ID NO: 71). 

An optimized firefly luciferase (hluc+)-optimized CLl -optimized PEST 
5 sequence (hluc+-hCLl -hPEST) has the following sequence: 

atggccgatgctaagaacattaagaagggccctgctcccttctaccctctggaggatggcaccgctggcgagcag 
ctgcacaaggccatgaagaggtatgccctggtgcctggcaccattgccttcaccgatgcccacattgaggtggaca 
tcacctatgccgagtacttcgagatgtctgtgcgcctggccgaggccatgaagaggtacggcctgaacaccaacc 

10 accgcatcgtggtgtgctctgagaactctctgcagttcttcatgccagtgctgggcgccctgttcatcggagtggccg 
tggcccctgctaacgacatttacaacgagcgcgagctgctgaacagcatgggcatttctcagcctaccgtggtgttc 
gtgtctaagaagggcctgcagaagatcctgaacgtgcagaagaagctgcctatcatccagaagatcatcatcatgg 
actctaagaccgactaccagggcttccagagcatgtacacattcgtgacatctcatctgcctcctggcttcaacgagt 
acgacttcgtgccagagtctttcgacagggacaaaaccattgccctgatcatgaacagctctgggtctaccggcctg 

15 cctaagggcgtggccctgcctcatcgcaccgcctgtgtgcgcttctctcacgcccgcgaccctattttcggcaacca 
gatcatccccgacaccgctattctgagcgtggtgccattccaccacggcttcggcatgttcaccaccctgggctacc 
tgatttgcggctttcgggtggtgctgatgtaccgcttcgaggaggagctgttcctgcgcagcctgcaagactacaaa 
attcagtctgccctgctggtgccaaccctgttcagcttcttcgctaagagcaccctgatcgacaagtacgacctgtcta 
acctgcacgagattgcctctggcggcgccccactgtctaaggaggtgggcgaagccgtggccaagcgctttcatc 

20 tgccaggcatccgccagggctacggcctgaccgagacaaccagcgccattctgattaccccagagggcgacga 
caagcctggcgccgtgggcaaggtggtgccattcttcgaggccaaggtggtggacctggacaccggcaagacc 
ctgggagtgaaccagcgcggcgagctgtgtgtgcgcggccctatgattatgtccggctacgtgaataaccctgag 
gccacaaacgccctgatcgacaaggacggctggctgcactctggcgacattgcctactgggacgaggacgagca 
cttcttcatcgtggaccgcctgaagtctctgatcaagtacaagggctaccaggtggccccagccgagctggagtct 

25 atcctgctgcagcaccctaacattttcgacgccggagtggccggcctgcccgacgacgatgccggcgagctgcct 
gccgccgtcgtcgtgctggaacacggcaagaccatgaccgagaaggagatcgtggactatgtggccagccaggt 
gacaaccgccaagaagctgcgcggcggagtggtgttcgtggacgaggtgcccaagggcctgaccggcaagct 
ggacgcccgcaagatccgcgagatcctgatcaaggctaagaaaggcggcaagatcgccgtgaattctGCtTG 
CAAGAACTGGTTCAGtAGCtTaAGCCACTTtGTGATCCACCTtAACAGCC 

30 ACGGCTTCCCtCCCGAGGTGGAGGAGCAGGCCGCCGGCACCCTGCCC 
ATGAGCTGCGCCCAGGAGAGCGGCATGGAtaGaCACCCtGCtGQTGCGC 
CAGCGCCAGGATCAACGTcTAg (SEQ ID NO: 11), 

An optimized Renilla luciferase-optimized PEST sequence (hRenilla-hPEST) 
35 has the following sequence: 

atggcttccaaggtgtacgaccccgagcaacgcaaacgcatgatcactgggcctcagtggtgggctcgctgcaag 
caaatgaacgtgctggactccttcatcaactactatgattccgagaagcacgccgagaacgccgtgatttttctgcat 
ggtaacgctgcctccagctacctgtggaggcacgtcgtgcctcacatcgagcccgtggctagatgcatcatccctg 

40 atctgatcggaatgggtaagtccggcaagagcgggaatggctcatatcgcctcctggatcactacaagtacctcac 
cgcttggttcgagctgctgaaccttccaaagaaaatcatctttgtgggccacgactggggggcttgtctggcctttca 
ctactcctacgagcaccaagacaagatcaaggccatcgtccatgctgagagtgtcgtggacgtgatcgagtcctgg 
gacgagtggcctgacatcgaggaggatatcgccctgatcaagagcgaagagggcgagaaaatggtgcttgaga 
ataacttcttcgtcgagaccatgctcccaagcaagatcatgcggaaactggagcctgaggagttcgctgcctacctg 

45 gagccattcaaggagaagggcgaggttagacggcctaccctctcctggcctcgcgagatccctctcgttaaggga 
ggcaagcccgacgtcgtccagattgtccgcaactacaacgcctaccttcgggccagcgacgatctgcctaagatgt 
tcatcgagtccgaccctgggttcttttccaacgctattgtcgagggagctaagaagttccctaacaccgagttcgtga 
aggtgaagggcctccacttcagccaggaggacgctccagatgaaatgggtaagtacatcaagagcttcgtggagc 
gcgtgctgaagaacgagcagaattctCACGGCTTCCCtCCCGAGGTGGAGGAGCAGG 

50 CCGCCGGCACCCTGCCCATGAGCTGCGCCCAGGAGAGCGGCATGGAta 
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GaCACCCtGCtGCtTGCGCCAGCGCCAGGATCAACGTcTAA (SEO ID NO- 
73). 

An optimized Renilla luciferase-optimized CLl -optimized PEST sequence 
5 (hReniUa-hGLl -hPEST) has the foUowing sequence: 

atggcttccaaggtgtacgaccccgagcaacgcaaacgcatgatcactgggcctcagtggtgggctcgctgcaag 
caaatgaacgtgctggactccttcatcaactactatgattccgagaagcacgccgagaacgccgtgatttttctgcat 
ggtaacgctgcctccagctacctgtggaggcacgtcgtgcctcacatcgagcccgtggctagatgcatcatccctg 
atctgatcggaatgggtaagtccggcaagagcgggaatggctcatatcgcctcctggatcactacaagtacctcac 
cgcttggttcgagctgctgaaccttccaaagaaaatcatctttgtgggccacgactggggggcttgtctggcctttca 
ctactcctacgagcaccaagacaagatcaaggccatcgtccatgctgagagtgtcgtggacgtgatcgagtcctgg 
gacgagtggcctgacatcgaggaggatatcgccctgatcaagagcgaagagggcgagaaaatggtgcttgaga 
ataacttcttcgtcgagaccatgctcccaagcaagatcatgcggaaactggagcctgaggagttcgctgcctacctg 
gagccattcaaggagaagggcgaggttagacggcctaccctctcctggcctcgcgagatccctctcgttaaggga 
ggcaagcccgacgtcgtccagattgtccgcaactacaacgcctaccttcgggccagcgacgatctgcctaagatgt 
tcatcgagtccgaccctgggttcttttccaacgctattgtcgagggagctaagaagttccctaacaccgagttcgtga 
aggtgaagggcctccacttcagccaggaggacgctccagatgaaatgggtaagtacatcaagagcttcgtggagc 
gcgtgctgaagaacgagcagaattctGCtTGCAAGAACTGGTTCAGtAGCtTaAGCCAC 
TTtGTGATCCACCTtAACAGCCACGGCTTCCCtCCCGAGGTGGAGGAGC 
AGGCCGCCGGCACCCTGCCCATGAGCTGCGCCCAGGAGAGCGGCATG 
GAtaGaCACCCtGCtGCtTGCGCCAGCGCCAGGATCAACGTcTAg (SEQ ID 

An optimized Renilla luciferase-optimized CLl-optimized PEST-UTR sequence 
(hRluc-hCLl-hPEST-UTR) has the following sequence: 

ATGGCTTCCAAGGTGTACGACCCCGAGCAACGCAAACGCATGATCAC 
TGGGCCTCAGTGGTGGGCTCGCTGCAAGCAAATGAACGTGCTGGACT 
CCTTCATCAACTACTATGATTCCGAGAAGCACGCCGAGAACGCCGTG 
ATTTTTCTGCATGGTAACGCTGCCTCCAGCTACCTGTGGAGGCACGTC 
GTGCCTCACATCGAGCCCGTGGCTAGATGCATCATCCCTGATCTGATC 
GGAATGGGTAAGTCCGGCAAGAGCGGGAATGGCTCATATCGCCTCCT 
GGATCACTACAAGTACCTCACCGCTTGGTTCGAGCTGCTGAACCTTCC 
AAAGAAAATCATCTTTGTGGGCCACGACTGGGGGGCTTGTCTGGCCT 
TTCACTACTCCTACGAGCACCAAGACAAGATCAAGGCCATCGTCCAT 
GCTGAGAGTGTCGTGGACGTGATCGAGTCCTGGGACGAGTGGCCTGA 
CATCGAGGAGGATATCGCCCTGATCAAGAGCGAAGAGGGCGAGAAA 
ATGGTGCTTGAGAATAACTTCTTCGTCGAGACCATGCTCCCAAGCAA 
GATCATGCGGAAACTGGAGCCTGAGGAGTTeGCTGCCTACCTGGAGC 
CATTCAAGGAGAAGGGCGAGGTTAGACGGCCTACCCTCTCCTGGCCT 
CGCGAGATCCCTCTCGTTAAGGGAGGCAAGCCCGACGTCGTCCAGAT 
TGTCCGCAACTACAACGCCTACCTTCGGGCCAGCGACGATCTGCCTA 
AGATGTTCATCGAGTCCGACCCTGGGTTCTTTTCCAACGCTATTGTCG 
AGGGAGCTAAGAAGTTCCCTAACACCGAGTTCGTGAAGGTGAAGGG 
CCTCCACTTCAGCCAGGAGGACGCTCCAGATGAAATGGGTAAGTACA 
TCAAGAGCTTCGTGGAGCGCGTGCTGAAGAACGAGCAGAATTCTGCT 
TGCAAGAACTGGTTCAGTAGCTTAAGCCACTTTGTGATCCACCTTAAC 
AGCCACGGCTTCCCTCCCGAGGTGGAGGAGCAGGCCGCCGGCACCCT 
GCCCATGAGCTGCGCCCAGGAGAGCGGCATGGATAGACACCCTGCTG 
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CTTGCGCCAGCGCCAGGATCAACGTCTAGGGCGCGGACTTTATTTATT 
TATTTCTT (SEQ ID NO:75) 

An optimized firefly luciferase-optimized CLl -optimized PEST-UTR sequence 
5 (hluc+-hCLl-hPEST-UTR) has the following sequence: 

ATGGCCGATGCTAAGAACATTAAGAAGGGCCCTGCTCCCTTCTACCC 
TCTGGAGGATGGCACCGCTGGCGAGCAGCTGCACAAGGCCATGAAG 
AGGTATGCCCTGGTGCCTGGCACCATTGCCTTCACCGATGCCCACATT 
GAGGTGGACATCACCTATGCCGAGTACTTCGAGATGTCTGTGCGCCT 
GGCCGAGGCCATGAAGAGGTACGGCCTGAACACCAACCACCGCATC 
GTGGTGTGCTCTGAGAACTCTCTGCAGTTCTTCATGCCAGTGCTGGGC 
GCCCTGTTCATCGGAGTGGCCGTGGCCCCTGCTAACGACATTTACAA 
CGAGCGCGAGCTGCTGAACAGCATGGGCATTTCTCAGCCTACCGTGG 
TGTTCGTGTCTAAGAAGGGCCTGCAGAAGATCCTGAACGTGCAGAAG 
AAGCTGCCTATCATCCAGAAGATCATCATCATGGACTCTAAGACCGA 
CTACCAGGGCTTCCAGAGCATGTACACATTCGTGACATCTCATCTGCC 
TCCTGGCTTCAACGAGTACGACTTCGTGCCAGAGTCTTTCGACAGGG 
ACAAAACCATTGCCCTGATCATGAACAGCTCTGGGTCTACCGGCCTG 
CCTAAGGGCGTGGCCCTGCCTCATCGCACCGCCTGTGTGCGCTTCTCT 
CACGCCCGCGACCCTATTTTCGGCAACCAGATCATCCCCGACACCGC 
TATTCTGAGCGTGGTGCCATTCCACCACGGCTTCGGCATGTTCACCAC 
CCTGGGCTACCTGATTTGCGGCTTTCGGGTGGTGCTGATGTACCGCTT 
CGAGGAGGAGCTGTTCCTGCGCAGCCTGCAAGACTACAAAATTCAGT 
CTGCCCTGCTGGTGCCAACCCTGTTCAGCTTCTTCGCTAAGAGCACCC 
TGATCGACAAGTACGACCTGTCTAACCTGCACGAGATTGCCTCTGGC 
GGCGCCCCACTGTCTAAGGAGGTGGGCGAAGCCGTGGCCAAGCGCTT 
TCATCTGCCAGGCATCCGCCAGGGCTACGGCCTGACCGAGACAACCA 
GCGCCATTCTGATTACCCCAGAGGGCGACGACAAGCCTGGCGCCGTG 
GGCAAGGTGGTGCCATTCTTCGAGGCCAAGGTGGTGGACCTGGACAC 
CGGCAAGACCCTGGGAGTGAACCAGCGCGGCGAGCTGTGTGTGCGC 
GGCCCTATGATTATGTCCGGCTACGTGAATAACCCTGAGGCCACAAA 
CGCCCTGATCGACAAGGACGGCTGGCTGCACTCTGGCGACATTGCCT 
ACTGGGACGAGGACGAGCACTTCTTCATCGTGGACCGCCTGAAGTCT 
CTGATCAAGTACAAGGGCTACCAGGTGGCCCCAGCCGAGCTGGAGTC 
TATCCTGCTGCAGCACCCTAACATTTTCGACGCCGGAGTGGCCGGCCT 
GCCCGACGACGATGCCGGCGAGCTGCCTGCCGCCGTCGTCGTGCTGG 
AACACGGCAAGACCATGACCGAGAAGGAGATCGTGGACTATGTGGC 
CAGCCAGGTGACAACCGCCAAGAAGCTGCGCGGCGGAGTGGTGTTC 
GTGGACGAGGTGCCCAAGGGCCTGACCGGCAAGCTGGACGCCCGCA 
AGATCCGCGAGATCCTGATCAAGGCTAAGAAAGGCGGCAAGATCGC 
CGTGAATTCTGCTTGCAAGAACTGGTTCAGTAGCTTAAGCCACTTTGT 
GATCCACCTTAACAGCCACGGCTTCCCTCCCGAGGTGGAGGAGCAGG 
CCGCCGGCACCCTGCCCATGAGCTGCGCCCAGGAGAGCGGCATGGAT 
AGACACCCTGCTGCTTGCGCCAGCGCCAGGATCAACGTCTAGGGCGC 
GGACTTTATTTATTTATTTCTT (SEQ ID NO:76) 

Optimized click beetle sequences include: 

CBRluc-hPEST 
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ATGGTAAAGCGTGAGAAAAATGTCATCTATGGCCCTGAGCCTCTCCA 
TCCTTTGGAGGATTTGACTGCCGGCGAAATGCTGTTTCGTGCTCTCCG 
CAAGCACTCTCATTTGCCTCAAGCCTTGGTCGATGTGGTCGGCGATGA 
ATCTTTGAGCTACAAGGAGTTTTTTGAGGCAACCGTCTTGCTGGCTCA 
5 GTCCCTCCACAATTGTGGCTACAAGATGAACGACGTCGTTAGTATCT 
GTGCTGAAAACAATACCCGTTTCTTCATTCCAGTCATCGCCGCATGGT 
ATATCGGTATGATCGTGGCTCCAGTCAACGAGAGCTACATTCCCGAC 
GAACTGTGTAAAGTCATGGGTATCTCTAAGCCACAGATTGTCTTCACC 
ACTAAGAATATTCTGAACAAAGTCCTGGAAGTCCAAAGCCGCACCAA 

10 CTTTATTAAGCGTATCATCATCTTGGACACTGTGGAGAATATTCACGG 
TTGCGAATCTTTGCCTAATTTCATCTCTCGCTATTCAGACGGCAACAT 
CGCAAACTTTAAACCACTCCACTTCGACCCTGTGGAACAAGTTGCAG 
CCATTCTGTGTAGCAGCGGTACTACTGGACTCCCAAAGGGAGTCATG 
CAGACCCATCAAAACATTTGCGTGCGTCTGATCCATGCTCTCGATCCA 

1 5 CGCTACGGCACTC AGCTGATTCCTGGTGTCACCGTCTTGGTCTACTTG 
CCTTTCTTCCATGCTTTCGGCTTTCATATTACTTTGGGTTACTTTATGG 
TCGGTCTCCGCGTGATTATGTTCCGCCGTTTTGATCAGGAGGCTTTCT 
TGAAAGCCATCCAAGATTATGAAGTCCGCAGTGTCATCAACGTGCCT 
AGCGTGATCCTGTTTTTGTCTAAGAGCCCACTCGTGGACAAGTACGA 

20 CTTGTCTTCACTGCGTGAATTGTGTTGCGGTGCCGCTCCACTGGCTAA 
GGAGGTCGCTGAAGTGGCCGCCAAACGCTTGAATCTTCCAGGGATTC 
GTTGTGGCTTCGGCCTCACCGAATCTACCAGTGCGATTATCCAGACTC 
TCGGGGATGAGTTTAAGAGCGGCTCTTTGGGCCGTGTCACTCCACTC 
ATGGCTGCTAAGATCGCTGATCGCGAAACTGGTAAGGCTTTGGGCCC 

25 GAACCAAGTGGGCGAGCTGTGTATCAAAGGCCCTATGGTGAGCAAG 
GGTTATGTCAATAACGTTGAAGCTACCAAGGAGGCCATCGACGACGA 
CGGCTGGTTGCATTCTGGTGATTTTGGATATTACGACGAAGATGAGC 
ATTTTTACGTCGTGGATCGTTACAAGGAGCTGATCAAATACAAGGGT 
AGCCAGGTTGCTCCAGCTGAGTTGGAGGAGATTCTGTTGAAAAATCC 

30 ATGCATTCGCGATGTCGCTGTGGTCGGCATTCCTGATCTGGAGGCCG 
GCGAACTGCCTTCTGCTTTCGTTGTCAAGCAGCCTGGTACAGAAATTA 
CCGCCAAAGAAGTGTATGATTACCTGGCTGAACGTGTGAGCCATACT 
AAGTACTTGCGTGGCGGCGTGCGTTTTGTTGACTCCATCCCTCGTAAC 
GTAACAGGCAAAATTACCCGCAAGGAGCTGTTGAAACAATTGTTGGT 

35 GAAGGCCGGCGGGAATTCTCACGGCTTCCCTCCCGAGGTGGAGGAGC 
AGGCCGCCGGCACCCTGCCCATGAGCTGCGCCCAGGAGAGCGGCATG 
GATAGACACCCTGCTGCTTGCGCCAGCGCCAGGATCAACGTCTAA 
(SEQIDNO:77) 

40 CBRluc-hCLl-hPEST-UTR 

ATGGTAAAGCGTGAGAAAAATGTCATCTATGGCCCTGAGCCTCTCCA 
TCCTTTGGAGGATTTGACTGCCGGCGAAA.TGCTGTTTCGTGCTCTCCG 
CAAGCACTCTCATTTGCCTCAAGCCTTGGTCGATGTGGTCGGCGATGA 

45 ATCTTTGAGCTACAAGGAGTTTTTTGAGGCAACCGTCTTGCTGGCTCA 
GTCCCTCCACAATTGTGGCTACAAGATGAACGACGTCGTTAGTATCT 
GTGCTGAAAACAATACCCGTTTCTTCATTCCAGTCATCGCCGCATGGT 
ATATCGGTATGATCGTGGCTCCAGTCAACGAGAGCTACATTCCCGAC 
GAACTGTGTAAAGTCATGGGTATCTCTAAGCCACAGATTGTCTTCACC 

50 ACTAAGAATATTCTGAACAAAGTCCTGGAAGTCCAAAGCCGCACCAA 
CTTTATTAAGCGTATCATCATCTTGGACACTGTGGAGAATATTCACGG 
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TTGCGAATCTTTGCCTAATTTCATCTCTCGCTATTCAGACGGCAACAT 
CGCAAACTTTAAACCACTCCACTTCGACCCTGTGGAACAAGTTGCAG 
CCATTCTGTGTAGCAGCGGTACTACTGGACTCCCAAAGGGAGTCATG 
CAGACCCATCAAAACATTTGCGTGCGTCTGATCCATGCTCTCGATCCA 
5 CGCTACGGCACTCAGCTGATTCCTGGTGTCACCGTCTTGGTCTACTTG 
CCTTTCTTCCATGCTTTCGGCTTTCATATTACnTTGGGTTACTTTATGG 
TCGGTCTCCGCGTGATTATGTTCCGCCGTTTTGATCAGGAGGCTTTCT 
TGAAAGCCATCCAAGATTATGAAGTCCGCAGTGTCATCAACGTGCCT 
AGCGTGATCCTGTTTTTGTCTAAGAGCCCACTCGTGGACAAGTACGA 

10 CTTGTCTTCACTGCGTGAATTGTGTTGCGGTGCCGCTCCACTGGCTAA 
GGAGGTCGCTGAAGTGGCCGCCAAACGCTTGAATCTTCCAGGGATTC 
GTTGTGGCTTCGGCCTCACCGAATCTACCAGTGCGATTATCCAGACTC 
TCGGGGATGAGTTTAAGAGCGGCTCTTTGGGCCGTGTCACTCCACTC 
ATGGCTGCTAAGATCGCTGATCGCGAAACTGGTAAGGCTTTGGGCCC 

15 GAACCAAGTGGGCGAGCTGTGTATCAAAGGCCCTATGGTGAGCAAG 
GGTTATGTCAATAACGTTGAAGCTACCAAGGAGGCCATCGACGACGA 
CGGCTGGTTGCATTCTGGTGATTTTGGATATTACGACGAAGATGAGC 
ATTTTTACGTCGTGGATCGTTACAAGGAGCTGATCAAATACAAGGGT 
AGCCAGGTTGCTCCAGCTGAGTTGGAGGAGATTCTGTTGAAAAATCC 

20 ATGCATTCGCGATGTCGCTGTGGTCGGCATTCCTGATCTGGAGGCCG 
GCGAACTGCCTTCTGCTTTCGTTGTCAAGCAGCCTGGTACAGAAATTA 
CCGCCAAAGAAGTGTATGATTACCTGGCTGAACGTGTGAGCCATACT 
AAGTACTTGCGTGGCGGCGTGCGTTTTGTTGACTCCATCCCTCGTAAC 
GTAACAGGCAAAATTACCCGCAAGGAGCTGTTGAAACAATTGTTGGT 

25 GAAGGCCGGCGGGAATTCTGCTTGCAAGAACTGGTTCAGTAGCTTAA 
GCCACTTTGTGATCCACCTTAACAGCCACGGCTTCCCTCCCGAGGTGG 
AGGAGCAGGCCGCCGGCACCCTGCCCATGAGCTGCGCCCAGGAGAG 
CGGCATGGATAGACACCCTGCTGCTTGCGCCAGCGCCAGGATCAACG 
TCTAGGGCGCGGACTTTATTTATTTATTTCTT (SEQIDNO:78) 

30 

CBG991uc-hPEST 

ATGGTGAAGCGTGAGAAAAATGTCATCTATGGCCCTGAGCCTCTCCA 
TCCTTTGGAGGATTTGACTGCCGGCGAAATGCTGTTTCGTGCTCTCCG 

35 CAAGCACTCTCATTTGCCTCAAGCCTTGGTCGATGTGGTCGGCGATGA 
ATCTTTGAGCTACAAGGAGTTTTTTGAGGCAACCGTCTTGCTGGCTCA 
GTCCCTCCACAATTGTGGCTACAAGATGAACGACGTCGTTAGTATCT 
GTGCTGAAAACAATACCCGTTTCTTCATTCCAGTCATCGCCGCATGGT 
ATATCGGTATGATCGTGGCTCCAGTCAACGAGAGCTACATTCCCGAC 

40 GAACTGTGTAAAGTCATGGGTATCTCTAAGCCACAGATTGTCTTCACC 
ACTAAGAATATTCTGAACAAAGTCCTGGAAGTCCAAAGCCGCACCAA 
CTTTATTAAGCGTATCATCATCTTGGACACTGTGGAGAATATTCACGG 
TTGCGAATCTTTGCCTAATTTCATCTCTCGCTATTCAGACGGCAACAT 
CGCAAACTTTAAACCACTCCACTTCGACCCTGTGGAACAAGTTGCAG 

45 CCATTCTGTGTAGCAGCGGTACTACTGGACTCCCAAAGGGAGTCATG 
CAGACCCATCAAAACATTTGCGTGCGTCTGATCCATGCTCTCGATCCA 
CGCGTGGGCACTCAGCTGATTCCTGGTGTCACCGTCTTGGTCTACTTG 
CCTTTCITCCATGCTTTCGGCTTTAGCATTACTTTGGGTTACTTTATGG 
TCGGTCTCCGCGTGATTATGTTCCGCCGTTTTGATCAGGAGGCTTTCT 

50 TGAAAGCCATCCAAGATTATGAAGTCCGCAGTGTCATCAACGTGCCT 
AGCGTGATCCTGTTTTTGTCTAAGAGCCCACTCGTGGACAAGTACGA 
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CTTGTCTTCACTGCGTGAATTGTGTTGCGGTGCCGCTCCACTGGCTAA 
GGAGGTCGCTGAAGTGGCCGCCAAACGCTTGAATCTTCCAGGGATTC 
GTTGTGGCTTCGGCCTCACCGAATCTACCAGCGCTAACATTCACTCTC 
TCGGGGATGAGTTTAAGAGCGGCTCTTTGGGCCGTGTCACTCCACTC 
5 ATGGCTGCTAAGATCGCTGATCGCGAAACTGGTAAGGCTTTGGGCCC 
GAACCAAGTGGGCGAGCTGTGTATCAAAGGCCCTATGGTGAGCAAG 
GGTTATGTCAATAACGTTGAAGCTACCAAGGAGGCCATCGACGACGA 
CGGCTGGTTGCATTCTGGTGATTTTGGATATTACGACGAAGATGAGC 
ATTTTTACGTCGTGGATCGTTACAAGGAGCTGATCAAATACAAGGGT 

10 AGCCAGGTTGCTCCAGCTGAGTTGGAGGAGATTCTGTTGAAAAATCC 
ATGCATTCGCGATGTCGCTGTGGTCGGCATTCCTGATCTGGAGGCCG 
GCGAACTGCCTTCTGCTTTCGTTGTCAAGCAGCCTGGTAAAGAAATTA 
CCGCCAAAGAAGTGTATGATTACCTGGCTGAACGTGTGAGCCATACT 
AAGTACTTGCGTGGCGGCGTGCGTTTTGTTGACTCCATCCCTCGTAAC 

1 5 GTAACAGGCAAAATTACCCGCAAGGAGCTGTTGAAACAATTGTTGGA 
GAAGGCCGGCGGGAATTCTCACGGCTTCCCTCCCGAGGTGGAGGAGC 
AGGCCGCCGGCACCCTGCCCATGAGCTGCGCCCAGGAGAGCGGCATG 
GATAGACACCCTGCTGCTTGCGCCAGCGCCAGGATCAACGTCTAA 
(SEQ ID NO:79) 

20 

CBG991uc-hCLl-liPEST-UTR 
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ATGGTGAAGCGTGAGAAAAATGTCATCTATGGCCCTGAGCCTCTCCA 
TCCTTTGGAGGATTTGACTGCCGGCGAAATGCTGTTTCGTGCTCTCCG 
CAAGCACTCTCATTTGCCTCAAGCCTTGGTCGATGTGGTCGGCGATGA 
ATCTTTGAGCTACAAGGAGTTTTTTGAGGCAACCGTCTTGCTGGCTCA 
5 GTCCCTCCACAATTGTGGCTACAAGATGAACGACGTCGTTAGTATCT 
GTGCTGAAAACAATACCCGTTTCTTCATTCCAGTCATCGCCGCATGGT 
ATATCGGTATGATCGTGGCTCCAGTCAACGAGAGCTACATTCCCGAC 
GAACTGTGTAAAGTCATGGGTATCTCTAAGCCACAGATTGTCTTCACC 
ACTAAGAATATTCTGAACAAAGTCCTGGAAGTCCAAAGCCGCACCAA 

10 CTTTATTAAGCGTATCATCATCTTGGACACTGTGGAGAATATTCACGG 
TTGCGAATCTTTGCCTAATTTCATCTCTCGCTATTCAGACGGCAACAT 
CGCAAACTTTAAACCACTCCACTTCGACCCTGTGGAACAAGTTGCAG 
CCATTCTGTGTAGCAGCGGTACTACTGGACTCCCAAAGGGAGTCATG 
CAGACCCATCAAAACATTTGCGTGCGTCTGATCCATGCTCTCGATCCA 

15 CGCGTGGGCACTCAGCTGATTCCTGGTGTCACCGTCTTGGTCTACTTG 
CCTTT(m'CCATGCTTTCGGCTTTAGCATTACTTTGGGTTACTTTATGG 
TCGGTCTCCGCGTGATTATGTTCCGCCGTTTTGATCAGGAGGCTTTCT 
TGAAAGCCATCCAAGATTATGAAGTCCGCAGTGTCATCAACGTGCCT 
AGCGTGATCCTGTTTTTGTCTAAGAGCCCACTCGTGGACAAGTACGA 

20 CTTGTCTTCACTGCGTGAATTGTGTTGCGGTGCCGCTCCACTGGCTAA 
GGAGGTCGCTGAAGTGGCCGCCAAACGCTTGAATCTTCCAGGGATTC 
GTTGTGGCTTCGGCCTCACCGAATCTACCAGCGCTAACATTCACTCTC 
TCGGGGATGAGTTTAAGAGCGGCTCTTTGGGCCGTGTCACTCCACTC 
ATGGCTGCTAAGATCGCTGATCGCGAAACTGGTAAGGCTTTGGGCCC 

25 GAACCAAGTGGGCGAGCTGTGTATCAAAGGCCCTATGGTGAGCAAG 
GGTTATGTCAATAACGTTGAAGCTACCAAGGAGGCCATCGACGACGA 
CGGCTGGTTGCATTCTGGTGATTTTGGATATTACGACGAAGATGAGC 
ATTTTTACGTCGTGGATCGTTACAAGGAGCTGATCAAATACAAGGGT 
AGCCAGGTTGCTCCAGCTGAGTTGGAGGAGATTCTGTTGAAAAATCC 

30 ATGCATTCGCGATGTCGCTGTGGTCGGCATTCCTGATCTGGAGGCCG 
GCGAACTGCCTTCTGCTTTCGTTGTCAAGCAGCCTGGTAAAGAAATTA 
CCGCCAAAGAAGTGTATGATTACCTGGCTGAACGTGTGAGCCATACT 
AAGTACTTGCGTGGCGGCGTGCGTTTTGTTGACTCCATCCCTCGTAAC 
GTAACAGGCAAAATTACCCGCAAGGAGCTGTTGAAACAATTGTTGGA 

35 GAAGGCCGGCGGGAATTCTGCTTGCAAGAACTGGTTCAGTAGCTTAA 
GCCACTTTGTGATCCACCTTAACAGCCACGGCTTCCCTCCCGAGGTGG 
AGGAGCAGGCCGCCGGCACCCTGCCCATGAGCTGCGCCCAGGAGAG 
CGGCATGGATAGACACCCTGCTGCTTGCGCCAGCGCCAGGATCAACG 
TCTAGGGCGCGGACTTTATTTATTTATTTCTT (SEQ ID NO:80) 

40 
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Table 1 



Name of the 
primer 


Sequence of the primer 


LucN 


5 - AGATCTGCGATCTAAGTAAGCTTCjCj; bbQ ID JNU.lo 


LucC 


5'- ACTCTAGAATTCACGGCGATCTTTCC; SEQ IDNO: 17 


HRLN 


5'-GGCGAAGCTTGGGTCACCTCCAAGGTGTACGACCCCGAGC; 
SEQ ID NO: 18 


HRLC 


5'- GCrcrAGAATGAATTCTGCTCGTTCTTCAGCACGCGCT; SEQ 
IDN0:19 


PEST-5' 


5'-AATTCTCATGGCTTCCCGCCGGAGATGGAGGAGCAGGCT 
GCTGGCACGCTGCCCATGTCTT; SEQ ID N0:8 


PEST-3' 


5'-GTGCCCAGGAGAGCGGGATGGACCGTCACCCTGCAGCCT 
GTGCTTCTGCTAGGATCAATGTGTAA; SEQ ID N0:9 


5*-PEST 


5'-GGCCTTACACATTGATCCTAGCAGAAGCACAGGCTGCAG 
GGTGACGGTCCATCCCGCTCTCCT; SEQ ID NO: 10 


3»-PEST 


5'-GGGCACAAGACATGGGCAGCGTGCCAGCAGCCTGCTCCT 
CCATCTCCGGCGGGAAGCCATGAG; SEQ ID NO: 1 1 


Ubiquitin 
5*wt/BsytEn 


5'- TAGCATGGTCACCCAGATTTTCGTGAAAACCCTTACG; SEQ 
ID NO:20 


Ubiquitin 
3'w/BstEn 


5'-ATGCTAGGTGACCGGATCCCGCGGATAACCACCA; SEQ ID 
NO:21 




5'-CCATGGGACATCATCACCATCACCACGGGGATCCACAAG 
CTTATGAAGAAATTAGCAA; SEQ ID NO:22 


Ubi-Luc 3* 
w/ Linker 


5'-TTCTGGATCCCGCGGTATACCACCACGAAGACTCAACAC; 
SEQ ID NO:23 


Ubi-Luc 3* 
w/ Linker 
Met 


5'-TTCTGGATCCCGCGGCATACCACCACGAAGACTCAACAC; 
SEQ ID NO:24 


Ubi-Luc3* 
w/Linker Glu 


5'-TTCTGGATCCCGCGGCTCACCACCACGAAGACTCAACAC; 
SEQ ID NO:25 


Ubi-Luc 5* 
w/Linker 


5'-TATGGGCCCTTAATACGACTCACTATAGGGGAATTGTGA 
GCGGATAACAATTCCCCTCrAGAAATAA'ril'rGTTTAACTTTA 
AGAAGGAGATATAC CATGCAGATnTCGTGAAAACC; SEQ ID 
NO:26 


Ubiq(R) 


5'- TTTTGGCGTCGGTGACCGGATCCCGCGGTCGACCACCAC 
GAAG; SEQIDNO:27 


Ala 


5'-TTTTGGCGTCGGTGACCGGATCCCGCGGTGCACCACCAC 
GAAG; SEQIDNO:28 


Asn 


5'-TTTTGGCGTCGGTGACCGGATCCCGCGGGTTACCACCAC 
GAAG; SEQIDNO:29 


Asp 


5'-TTTTGGCGTCGGTGACCGGATCCCGCGGATCACCACCAC 
GAAG; SEQIDNO:30 


Phe 


5 -TTTTGGCGTCGGTGACCGGATCCCuCuuGAAACCACCAL 
GAAG; SEQIDN0:31 


His2 


S'-TTTTGGCGTCGGTGACCGGATCCCGCGGATGACCACCAC 
GAAG;SEQIDN0:32 


His 


S'-TTTTGGCGTCGGTGACCGGATCCCGCGGGTGACCACCAC 
GAAG; SEQIDNO:33 


Leu 


5'-TTTTGGCGTCGGTGACCGGATCCCGCGGGAGACCACCAC 
GAAG; SEQIDNO:34 



47 



wo 2004/025264 



PCTAJS2003/028939 



Lys 


5'-TTTTGGCGTCGGTGACCGGATCCCGCGGCTTACCACCACG 
AAG;SEQIDNO:35 


Gin 


5'-TTTTGGCGTCGGTGACCGGATCCCGCGGTTGACCACCAC 
GAAG; SEQ ID NO:36 


Trp 


5'-TTTTGGCGTCGGTGACCGGATCCCGCGGCCAACCACCAC 
GAAG; SEQ DD NO:37 


Ubiq(E)del5' 


5'- GTTTTTGGCGTCGGTGACCTCACCACCACGAAGACTC; SEQ 
IDNO:38 


Ubiq(E)del3' 


5'- GAGTCTTCGTGGTGGTGAGGTCACCGACGCCAAAAAC; SEQ 
IDNO:39 


Luc 5' 


5'-GTTCCAGGAACCAGGGCGTATCTC; SEQ ID NO:40 


Luc 3* 


5'-CGCGGAGGAGTTGTGTTTGTGGAC; SEQ ID N0:41 


Luc+N 


5'-GGCGAAGCTTGGGTCACCGATGCTAAGAACATTAAGAA 
GGG; SEQ ID NO:42 


Luc+C 


5 -GCTCTAGAATGAATTCACGGCGATCTTGCCGCC; SEQ ID 
JNU:4J 


tetu-j 




tetO-3* 


5'-AGCTAGCGAGGCTGGATCGGTCCCGGT; SEQ ID NO:51 


A T TTTTT 

AUUU 


5 - CTAGATTTATTTATTTATTTCTTCATATGC; SEQ ID NO:52 


Anti-AUUU 


5'- AATTGCATATGAAGAAATAAATAAATAAAT; SEQ ID NO:53 


hsp70-5* 


S'-ATTAATCTGATCAATAAAGGGTTTAAGG; SEQ ID NO:54 


hsp70-3' 


5'-AAAAAGGTAGTGGACTGTCG; SEQ ID NO:55 


3*-BKBl 


5'-AATTGGGAATTAAAACAGCATTGAACCAAGAAGCTTGGCT 
TTCTTATCAATTCTTTGTGACATAATAAGTT; SEQ ID NO:56 


5*-BKBlrev 


5'-AACTTATTATGTCACAAAGAATTGATAAGAAAGCCAAGCT 
TCTTGGTTCAATGCTG rnTAATTCCC; SEQ ID NO:57 


N-M 


5'-GATCTGCGGCCGCATATATG; SEQ IDNO:58 


M-C 


5'-GTQACCATATATGCGGCCGCA; SEQ ED NO:59 


CLl-RI-final 


5'-AATTTGTCTGCCTGCAAGAACTGGTTCAGCAGCTTGAGCCA 
CTTCGTGATCCACTTG; SEQ ID NO:60 


Rev-CLl-RI- 
final 


5'-AATTCAAGTGGATCACGAAGTGGCTCAAGCTGCTGAACCA 
GTTCTTGCAGGCAGACA; SEQ ID NO:61 


CLl 


5'-AATTCTGCCTGCAAGAACTGGTTCAGCAGCTTGAGCCACTT 
CGTGATCCACTTGTAAGC; SEQ ED NO:62 


Rev-CLl 


5'-GGCCGCTTACAAGTGGATCACGAAGTGGCTCAAGCTGCT 
GAACCAGTTCTTGCAGGCAG; SEQ ID NO:63 


CLl-M 


5 -GATCrTATGTCTGCCTGCAAGAACTGGTTCAGCAGCTTGAG 
CCACTTCGTGATCCACTTGCA; SEQ ED NO:64 


Rev-CLl-N 


5'-AGCTTGCAAGTGGATCACGAAGTGGCTCAAGCTGCTGAA 
CCAGTTCTTGCAGGCAGACATAA; SEQ ED NO:65 


AUUU 


S'-CTAGATTTATTTATTTATTTCTTCATATGC; SEQ ID NO :3 


Anti-AUUU 


5'-AATTGCATATGAAGAAATAAATAAATAAAT; SEQ ED NO :4 



DNA sequencing 

Plasmid DNA sequences were confirmed by DNA sequencing which was 
performed on ABI Prizm Model 377 using either Luc5' or Luc3* primers (Table 
5 1). 

Expression in vitro 
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Both TNT® SP6 Coupled Wheat Germ Extract System and TNT® T7 
Coupled Reticulocyte Lysate System (Promega) were used to express firefly 
luciferase and fusion proteins thereof in vitro. [^H]-Leucine was included in the 
reaction mixture. Upon completion, the reaction mixtures were separated into 
5 two portions. The first portion was used to determine luciferase activity as 
described in the section entitled "Luciferase assay conditions" and the second 
portion was used to determine the quantity of synthesized luciferase. Proteins 
contained in the second portion were separated by SDS gel electrophoresis using 
4-20% Tris-glycine gels (Novex). After completion of the electrophoresis, the 

10 location of the protein of interest on the gel was determined by autoradiography. 
Then bands containing proteins of interest were cut from the gel and the amounts 
of incorporated radioactivity determined by Uquid scintillation. The ratio 
between luminescence data and the amount of radioactivity was used to 
characterize specific activity. 

15 Mammalian cells 

Human adenocarcinoma cell line HeLa, Afincan green monkey kidney 
cell line COS-7, Chinese hamster ovary cell line CHO-Kl, and human 
embryonic kidney 293 cells were obtained from ATCC. All cell lines were 
maintained in RPMI-1640 medium containing 5% fetal bovine serum and a 

20 mixture of antibiotics (penicillin, 100 jUg/ml; streptomycin, 100 ^tg/ml; 
amphotericin B, 0.25 p,g/ml). 

Transient transfection and treatment with cvcloheximide 

For transfection, cells were grown to confluence in T25 flasks (Falkon, 
Becton Dickinson, Oxford). Transfection was conducted in 1 ml of serum-free 

25 RPMI-1640 that was mixed with 8 iig of plasmid DNA and 20 ^1 of 
Lipofectamine™2000 (GIBCO BRL). CHO cells were incubated in the 
transfection media for 30 minutes, HeLa cells for 1 hour, and COS-7 cells for 5 
hours. Following incubation, cells were trypsinized with Trypsin-EDTA 
(GIBCO BRL) and collected by centrifiigation. Cells collected from each T25 

30 flask were resuspended in 10 ml of RPMI-1640 medium containing 5% fetal 
bovine serum and a mixture of antibiotics, transferred to 96 well plates (100 
|il/well), and allowed to grow for 12-16 hours prior to treatment with different 
agents. Cycloheximide (Sigma) or doxycycline was added to different wells at 
different times (the final concentration was 100 fxg/ml and 2 ng/ml. 



wo 2004/025264 



PCTAJS2003/028939 



respectively). For thermoinduction, cells were transferred to 42°C for 1 hour 
and then were incubated at 37*^0 for different periods. After incubation, plates 
with cells expressing derivatives of firefly luciferase were transferred to 
70°C. In the case of cells expressing Renilla luciferase, the culture media was 
5 substituted with lysis buffer jfrom the Renilla Luciferase Assay System 
(Proraega). Luciferase assay conditions 

To determine firefly luciferase activity, cells were lysed by 
fireeze/thawing. Lysate fi-om each well (100 /il) was transferred into 
corresponding wells of opaque 96 well plates (Falkon, Becton Dickinson, 

10 Oxford) containing 100 |il of Bright-Glo™ Luciferase Assay System (Promega) 
and luminescence intensity was determined using MLX Microtiter Plate 
Limiinometer (Dynex). The activity of Renilla luciferase was measured by 
mixing 40 |il samples with 100 |il of assay reagent firom the Renilla Luciferase 
Assay System (Promega). 

15 Tet-Offsvstem 

A DNA firagment containing the CMV minimal promotor with 
heptamerized upstream tet-operators (Gossen et al., 1992) was amphfied fi-om 
plasmid pUHDlO-3 by PGR with primers: 

AGCTAGCGAGGCTGGATCGGTCCCGGT (SEQ ID NO:44) and 
20 GATTAATGGCCCTTTCGTCCTCGAGTT (SEQ ID NO:45). The amplified 
firagment was used to substitute the CMV promoter into Renilla luciferase 
encoding plasmids. HeLa cells were transfected with a mixture contaixiing a 
Renilla luciferase encoding plasmidand plasmid pUHD15-l in ratio of 4.5:1. 
Plasmid pUHD15-l encodes a hybrid transactivator that contains the tetracycline 
25 repressor and the C-terminal domain of VP 1 6 firom HS V which sthnulates 

minimal promoters fiised to tetracycline operator sequences. In the presence of 
doxycycline, activity of the hybrid transactivator is inhibited. 
CRE svstem 

D293 cells are an isolated subpopulation of 293 cells that produce a 
30 significant amount of cAMP upon induction. pGL-3 plasmids contain multiple 
CREs which respond to cAMP induction by increasing transcription. D293 cells 
were trypsin treated and 7.5 x 10^ cells were added to wells of a 96-well plate. 
After an overnight incubation, the media firom transfected cells was removed and 
replaced with media containing isoproterenol (Iso, Calbiochem #420355, final 
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concentration 1 [iM) and RO (Ro-20-1724, Calbiochem #557502, final 
concentration 100 fxM). Iso induces the c AMP pathway and RO prevents 
degradation of cAMP. The plates were returned to the incubator. Samples from 
time points t = 0, 3, 6, 9 and 12 hours post-Iso/RO were collected. At 6 or 24 
5 hours post-Iso/RO addition, prostaglandin El (PGEl, Calbiochem #538903, 
final concentration 1.0 /xM) was added to the cells. Samples from time points 
24, 27, 30, 33, and 36 hours post-Iso/RO addition (i.e., experimental start) were 
then collected. 

For click beetle experiments, D293 cells were transiently transfected with 
10 codon optimized red (CBR) or green (CBG) click beetle sequences in 

conjunction with destabiUzation sequences. Two days post-transfection, the cells 
were treated with trypsin and 7.5 x 10^ cells were added to wells of a 96-well 
plate. After an overnight incubation, the media from transfected cells was 
removed and replaced with media containing isoproterenol. Samples from time 
15 points t = 0, 1.5, 3, 4.5, 6, and 7.5 hours post-induction were collected. 

To determine the relative Ught units (RLU) for the samples, 20 |il of 
passive lysis buffer was added to each well, the Dual Luciferase Assay was 
performed, and the RLU were collected. In order to determine the fold- 
induction, the RLU from the drug treated wells were divided by RLU from non- 
20 treated wells. 

Stable Cell Lines 

D293 cells were transfected with plasmids and then grown in media 
containing 600 fig/m\ of G418. Individual Unes of stably transfected cells were 
generated by seeding individual cells from the population grown in the 
G418-containing media into wells of a 96-well plate and growing the seeded 
cells in the G418-containing media. 
Results 

Construction and analy s is of deubiqutination of ubiquitin-fireflv luciferase 
fiisions 

To create luciferase species that would have at their N-termini amino 
acid residues of choice, the approach described earher by Varshavsky and 
coauthors was used (Bachmair et al, 1986). This approach utilizes the ability of 
ubiquitin-specific processing proteases in cells to cleave fiision proteins 
containing ubiquitin at the N-terminus. Such cleavage occurs umnediately after 



51 



wo 2004/025264 



PCT/US2003/028939 



the last amino acid residue of ubiquitin. According to Bachmair et al (1986) 
such deubiquitination occurs irrespective of the identity of the residue located 
immediately after the cleavage site. 

PlasniidspUbiq(Y)Lucl9 andpSPUbiqLucl encode ubiquitin-firefly 
5 luciferase fusion proteins containing a tyrosine residue immediately after the 
ubiquitin sequence. Plasmid pUbiq(Y)Lucl9 was designed to be expressed in 
mammalian cells and possesses an early promoter of CMV upstream and an 
SV40 polyadenylation signal downstream of the sequence encoding the fusion 
protein. Plasmid pSPUbiqLucl encodes the same protein as pUbiq(Y)Lucl9 but 

10 possesses a promoter recognized by the DNA polymerase of bacteriophage SP6. 
Therefore, plasmid pSPUbiqLucl can be used for in vitro production of mRNA 
encoding the fusion protein. Both of these plasmids were used to confirm that in 
eukaryotic cells, and in mammalian cells specifically, ubiquitin-firefly luciferase 
fusion proteins undergo deubiquitination. 

15 As shown in Figure 1, both in a wheat germ based in vitro translation 

system as well as in transiently transfected mammalian cells, expression of 
recombinant genes encoding ubiquitin-firefly luciferase fusion proteins resulted 
in accumulation of proteins that had molecular masses expected for wild-type 
firefly luciferase. The control wild-type firefly luciferase in these experiments 

20 was encoded either by plasmid pETUbiqLuc or pwtLucl . In addition to the 
major band, a minor band was also present on the autoradiograph of proteins 
generated in the wheat germ based system. This minor band corresponds to the 
full-size recombinant protein that was not deubiquitinated. In several 
experiments performed using CHO cells, the minor band was either much 

25 weaker than in the wheat germ based system or was not detectable at all, 

suggesting that within mammahan cells, deubiquitination of luciferase happens 
very quickly and efficiently. 

Comparison of specific activities of wild type luciferase and its derivatives 

Sung et al. (1995) reported that modifications of the firefly luciferase N- 
30 terminal region could interfere with enzymatic activity. Thus, luciferase species 
generated as a result of deubiquitination process might have different enzymatic 
activities than that of wild-type luciferase. To assess specific activities of 
luciferase fusion proteins and compare them with the specific activity of wild- 
type luciferase, plasmids pT7Ubiq(Y)Lucl9.2 and pT7 Ubiq(E)Lucl9.1 were 
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constructed so that, in addition to an eukaryotic promoter, they have a promoter 
of bacteriophage T7. As a result, these plasmids direct the synthesis of luciferase 
fusion proteins in an in vitro transcription/translation system. Plasmid 
pT7Ubiq(Y)Lucl9.2 encodes exactly the same protein as that encoded by 
5 plasmid pUbiq(Y)Lucl 9. Plasmid pT7 Ubiq(E)Lucl 9. 1 encodes a ubiquitin- 
firefly luciferase fusion protein that differs from the protein encoded by plasmid 
pT7Ubiq(Y)Lucl9.2 in only one position. In this position, located immediately 
after the last amino acid residue of ubiquitin, the protein encoded by plasmid 
pT7Ubiq(E)Lucl9.1 has a glutamic acid residue in place of a tyrosine residue. 
1 0 Additionally, plasmids pETwtLucl and pT7Luc-PEST10 were constructed that 
have a promoter of bacteriophage T7 and encode wild-type firefly luciferase or a 
fusion protein comprising jSrefly luciferase and a mutant form of C-ODC, 
respectively. 

Plasmids encoding wild-type luciferase as well as a luciferase fusion 

1 5 protein were used in a rabbit reticulocyte in vitro transcription/translation system 
to determine luciferase activities accumulated in each reaction mixture and 
normalize these activities by the amount of radioactive leucine incorporated in 
corresponding luciferase species. Data presented in Figure 1 (panel C) 
demonstrate that, similarly to that found in CHO cells, only deubiquitinated 

20 forms of luciferase were accumulated in rabbit reticulocyte in viti^o 
transcription/translation systems supplemented with either plasmid 
pT7Ubiq(Y)Lucl9.2 or pT7Ubiq(E)Lucl9.1. The presence in these mixtures of 
small proteins with electrophoretic mobihties of free ubiquitin evidences that 
full-size ubiquitin-luciferase fusions were produced in these reactions and were 

25 efficiently deubiquitinated. On the contrary, in the case of plasmid pT7Luc- 
PESTIO, accumulation of the full-size protein was observed. Nevertheless, the 
data presented in Figure 2 demonstrate that both deubiquitinated forms of 
luciferase have essentially the same specific activity as that of wild-type 
luciferase and the protein encoded by plasmid pT7Luc-PEST10. 

30 Comparison of ejBBciencies of luciferase destabilization by N-degron and a 
mutant of the C-terminus of m-ODC 

The half-life of the protem encoded by plasmid pUbiq(Y)Lucl9 was 
determined in mammaUan cells and compared to the half-lives of wild-type 
luciferase as well as fusion proteins comprising firefly luciferase and a mutant 
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foim of the C-ODC (Luc-PESTIO). This was done by evaluating the 
luminescence emitted by cells that were transiently transfected with either 
plasmid pUbiq(Y)Lucl9 or pwtLucl or pLuc-PESTlO, respectively, and then, 
for different periods of time, were exposed to the protein synthesis iiohibitor 
5 cycloheximide. hi this experiment, cycloheximide was used at a concentration 
that caused complete inhibition of protein synthesis within 7.5 minutes of 
exposure (100 |ig/ml). Results presented in Figure 3 demonstrate that in COS-7 
cells, the Ubiq(Y)Luc fusion protein has an intermediate half-Ufe compared to 
that of wild-type luciferase and a Luc-PEST fusion protem. The estimated half- 

10 life of Ubiq(Y)Luc in these cells is about 170-200 minutes, which is far longer 
than the half-Uves reported by Varshavsky (1992) for |3-galactosidase containing 
a tyrosine residue at the N-terminus both in E, coli and S. cerevisiae (2 and 10 
minutes, respectively). At the same time, the half-Ufe of wild-type luciferase in 
COS-7 cells (about 7 hours) was shorter than the half-life of the wild-type P- 

15 galactosidase both in E. coli and in S. cerevisiae (more than 10 and 20 hours, 
respectively), suggesting that in COS-7 cells, N-degron initiated degradation 
occurs less efficiently than in yeast and bacteria. The fact that the half-life of 
Luc-PEST in these cells (about 120 minutes) is shorter than the half-Ufe of 
Ubiq(Y)Luc suggests that the protein degrading capacity of the proteosome is 

20 not a limiting parameter for determining the half-Ufe of Ubiq(Y)Luc. When the 
same analysis was performed using either CHO or HeLa cells, it was found that 
degradation of all three proteins in these cells occurs sUghtly faster than in COS- 
7 cells (see Figure 7). 

Analvsis of the N-end rule in different cell Unes 

25 To understand to what degree in mammalian cells the N-end rule might 

differ from the same rule in yeast and E. coli, a set of plasmids was created that 
encode an ubiquitin-firefly luciferase fusion protein, namely pUbiq(Y)Lucl9, 
pT7Ubiq(Y)Lucl9.2 and pT7Ubiq(E)Lucl9.1 (for a Ust of plasmids see Table 
2). Proteins encoded by these plasmids have only one difference in their 

30 sequences; the amino acid residue located inmaediately after the ubiquitin 
sequence. Therefore, upon deubiquitination, these proteins should generate 
fkefly luciferase species that are different only in the first N-terminal amino acid 
residue. Plasmids by themselves have additional differences in the region located 
upstream of the fusion protein coding region. Some of these plasmids have an 
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additional bacteriophage T7 promoter (plasmids designated pT7UbiqCX)Luc). 
Nevertheless, as shown in Figure 3 (see curves for plasmids pT7Ubiq(Y)Lucl9.i 
and pUbiq(Y)Lucl9), the presence or the absence of bacteriophage T7 promoter 
had no effect on the stability of corresponding proteins. 
5 Table 2. 



Plasmid 


Amino acid residue following 
immediately after ubiquitin sequence 


PT7Ubiq(M)Lucl9.2 


Met 


PT7Ubiq(E)Lucl9.1 


Glu 


PT7Ubiq(I)Lucl9.1 


He 


pUbiq(R)Lucl3 


Arg 


pUbiq(F)LuclO 


Phe 


pXJbiq(A)Luc2 


Ala 


pXJbiq(N)Luc25 


Asn 


pUbiq(W)Lucl6 


Trp 


pUbiq(L)Luc23 


Leu 


pUbiq(Q)Luc36 


Gin 


pUbiq(Y)Lucl9 


Tyr 


pUbiq(Asp3)Lucl6 


Asp 


pUbiq(His2)Luc3 


His 1 


pUbiq(K)Luc4 


Lys 



The stabihties of proteins encoded by plasmids hsted in Table 2 were 
analyzed using cell lines derived from three different species, hamster (CHO), 
monkey (COS-7) and human (HeLa). It was observed that with one exception, 

1 0 Ubiq(M)Luc in HeLa cells, all ubiquitin containing firefly luciferase fusion 
proteins were degraded faster than wild-type luciferase (Figure 4). Patterns of 
relationships between amino acid residues positioned at the N-terminus of 
luciferase and the half-Ufe of the corresponding protein remained essentially 
unchanged from experiment to experiment. For all three cell Unes, a glutamic 

1 5 acid residue was found to be the most efficient destabihzing residue among all 
residues tested. Aspartic acid was also found to be a quite efficient destabihzing 
residue. At the same time, in all three cell Unes, basic amino acid residues were 
found to have relatively weak destabihzing properties. In CHO cells, the 
difference between the half-Uves of the most stable and the least stable 

20 constructs was almost two times smaller than the same differences determined in 
COS-7 and HeLa cells. Thus, in different cell Unes, the N-end rule might have a 
different role m determining the fate of the protein. 
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In general, a correlation was observed between the half-life of the 
specific protein and the luminescence of cells producing this protein. Indeed, in 
Figure 4, nfxost of the symbols characterizing the relationship between 
luminescence intensities and half-lives of firefly luciferase fiision protein tend to 
5 locate along straight lines. Nevertheless, in some cases such a correlation was 
not observed. For example, in COS-7 cells, the protein encoded by plasmid 
pUbiq(A)Luc2 produced much less luminescence than would be expected based 
on its half-Ufe in these cells. Moreover, when the same protein was produced in 
CHO or HeLa cells, a dramatic deviation fi-om the general rule was not observed. 

10 The potential role of inconsistencies in transfection efficiency in generating this 
phenomena was addressed by introducing, in addition to the plasmid encoding 
the firefly luciferase fusion protein, the same amount of plasmid encoding 
Renilla luciferase to each transfection mixture. Renilla luminescence in the 
transfected cells was used as a measure of transfection efficiency and to 

1 5 normalize firefly luminescence produced by the same cells. Normalization did 
not eliminate the observed inconsistencies, suggesting that such inconsistencies 
might reflect biological characteristics of the specific protein-cell pairs. 
Dependence of the N-end rule on the protein structure 

To understand dominance of the N-terminal residue in determining the 

20 fate of the protein within mammalian cells, the stabiUties of luciferase fiision 
proteins having the same amino acid residue at the N-terminus, but different 
peptide sequences attached to the firefly luciferase, were compared. As shown 
in Figure 5, proteins encoded by plasmids pUbiq(E)ALuc6 and pUbiq(H)ALucl8 
have four amino acid deletions when compared to proteins encoded by plasmids 

25 pT7Ubiq(E)Lucl9. 1 and pUbiq(His2)Luc3, respectively. In the protein encoded 
by plasmid pUbiqLuclS, one glutamic acid residue substitutes for residues found 
in the protein encoded by pUbiq(Y)Lucl9. 

Data presented in Figure 6 reveal that proteins with the same N-terminal 
residue could have dramatically different half-lives depending on the structure of 

30 the following sequence. For example, in COS-7 cells, the half-life of the protein 
encoded by plasmid pUbiq(Y)Lucl9 was about 140 minutes while the half-fife 
of the protein encoded by plasndd pUbiqLuclS was almost 7 hours. Depending 
on the cells used for the experiment, this difference could be less dramatic. 
Indeed, the same proteins when analyzed in CHO cells had half-Uves of about 



56 



1 



wo 2004/025264 PCT/US2003/028939 

190 minutes and about 240 minutes, respectively. Surprisingly, depending on the 
amino acid residue located at the N-terminus, the effect of changes in the 
following sequence could be different. Indeed, deletion of four amino acid 
residues in a protein that has a histidine residue at the N-terminus resulted in 
5 destabilization of the protein (in COS-7 cells the half-life was reduced from 
about 300 minutes to about 120 minutes and in CHO cells, from about 320 
minutes to about 200 minutes). At the same time, when the N-terminal position 
was occupied by an aspartic acid residue, the same deletion resulted in 
stabihzation of the protein in COS-7 cells (the half-Ufe changed from about 60 
10 minutes to about 200 minutes) and had essentially no effect on the stabiUty of 
the corresponding protein in CHO cells. Thus, even though the N-terminal 
residue plays an important role in the determining protein stability, it is not the 
dominant factor. 

Optimization of the reporter properties of firefly luciferase 

15 It had been reported that degradation of ODC occurs in the 26S 

proteosome without prior ubiquitination (Bercovich et al., 1989; Murakami et 
al, 1992), while degradation directed by N-degron was reported to be a 
ubiquitin-dependent process. To determine whether the combination of N-degron 
and C-ODC in the same protein might direct the recombinant protein towards 

20 degradation in the proteosome by two different pathways, thus decreasing the 
half-Ufe of the protein even ftirther, three plasmids, pUbiq(Y)Luc-PEST5, 
pUbiq(R)Luc-PEST12 andpT7Ubiq(E)Luc-PEST23, were constructed. 
Sequences of proteins encoded by these plasmids are different only in the 
position that follows immediately after the last amino acid residue of the 

25 ubiquitin sequence. In that position, the protein encoded by plasmid 
pUbiq(Y)Luc-PEST5 has a tyrosine, the protein encoded by plasmid 
pUbiq(R)Luc-PEST12 has an arginine, and the protein encoded by plasmid 
pT7Ubiq(E)Luc-PEST23 has a glutamic acid residue. The stabilities of proteins 
encoded by these plasmids were tested in HeLa (Figure 7), COS-7 and CHO 

30 cells. The analysis revealed that m all cell lines, proteins with two degradation 
signals were less stable than proteins that contained only one signal. For 
example, in HeLa cells, ubiquitin-luciferase ftision proteins Ubiq(E)Luc or Luc- 
PESTIO had half-Uves of 70 and 65 minutes, respectively (Figure 4 and Figure 
7). At the same time, the half-hves of proteins encoded by plasmids 
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pUbiq(Y)Luc-PEST5, pUbiq(R)Luc-PEST12 and pT7Ubiq(E)Luc-PEST23 were 
55, 40 and 30 minutes, respectively. Additionally, a combination of two 
degradation signals in the same protein resulted in more consistent degradation 
from cell line to cell line. For example, when three different cell Unes were used 
5 to determine the half-life of the protein Ubiq(E)Luc that contains only the N- 
degron, the value varied from 70 to 150 minutes (Figure 4). When the same cell 
lines were used to determine the half-life of the protein encoded by plasmid 
pT7Ubiq(E)Luc-PEST23, the half-hfe was 30 minutes for HeLa cells (Figure 7), 
45 minutes for COS-7 cells and 40 minutes for CHO cells. Luminescence data 
10 from cells transfected with plasmids encoding luciferase with another 

combination of protein destabilization sequences are shown in Figure 8. The 
presence of CL-1 and PEST in a luciferase fiision protein resulted in a protein 
that had a reduced half-hfe relative to a luciferase ftision protein with either CL- 
1 or a PEST sequence. 
15 To determine whether optimized codon sequences can enhance the 

amount of hght emitted in cells transfected with a plasmid encoding a luciferase 
fusion protein, plasmid pT7Ubiq(E)hLuc+PEST80 was constructed, which 
encodes the same protein as that encoded by plasmid pT7Ubiq(E)Luc-PEST23, 
except that it contains a luciferase encoding sequence that has been optimized 
20 for expression in human cells. As a result, translation of the fusion protein 
proceeds more efficiently when this protein is encoded by plasmid 
pT7Ubiq(E)hLuc+PEST80 rather than by plasmid pT7Ubiq(E)Luc-PEST23. As 
shown in Figure 7, the luminescence of cells transfected with plasmid 
pT7Ubiq(E)hLuc+PEST80 becomes comparable to that of cells producing wild- 
type firefly luciferase. Therefore, codon optimized sequences compensate for 
loss of expression when incorporated with a destabilization sequence as 
compared to a destablilization sequence in conjunction with a non-optimized 
codon sequence. 

To determine whether a mRNA destabilization sequence in the mRNA 
for a fusion polypeptide comprising a luciferase and a protein destabihzation 
sequence could fiirther decrease the half-life of expression of a luciferase 
encoded by an optunized sequence, plasmids with promoters hnked to optimized 
Renilla luciferase sequences and various combinations of destabilization 
sequences were tested (Figure 9). Generally, the greater the number of 
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destabilization sequences, the shorter the half-Ufe of expression of the encoded 
protein. 

Figures 10 and 12-16 show luminescence after the induction of 
expression of optimized Renilla, firefly or cUck beetle luciferase sequences firom 
5 plasmids having various combinations of destabiUzation sequences. Plasmids 
with more destabilization sequences generally had better response profiles than 
those with no or fewer destabilization sequences, i.e., destabilized reporters 
respond faster and their relative activation is higher than that of more stable 
derivatives. 

10 Figure 13 demonstrates that reporters can respond to two subsequent 

stimuh and that destabiUzed reporters are more suitable than stable reporters for 
detection of subsequent stimuli (when two stimuli occur in a relative short period 
of time) because a stable reporter does not have time to react. In the bottom 
graph of Figure 13, the curve corresponding to the stable version of optimized 

1 5 firefly luciferase continues to increase. At the same time, the curve 

corresponding to the destabilized protein, after reaching a maximum, begins to 
decrease, and only after the addition of hCG begins to increase again. 

To determine whether destabilized reporter proteins in stably transfected 
cell lines could be detected, D293 cells were transfected with plasmids 

20 containing luciferase encoding sequences xmder the control of a cAMP regulated 
promoter. Plasmid pCRE-hLuc+Kanl8 encodes a stable version of firefly 
luciferase, plasmid pCRE-hLucP+Kan8 encodes a luciferase fiision that has a 
PEST sequence at the C-terminus, and plasmid pCRE-hLucCP+Kan28 encodes a 
firefly luciferase fiision polypeptide that has CLl-PEST sequences as well as 

25 mRNA that has a mRNA destabilization sequence (UTR). G41 8-resistant clones 
were treated for 7 hours with 10 fiM of forskolin or incubated for the same 
period of time in forskolin-firee media. After the completion of the incubation 
period, luminescence was detennined using Bright-Glo reagent (Figure 14). 
Stable clones with destabilized constructs were generally as bright as stable 

30 clones with a nondestabilized construct. 
Discussion 

Varshavsky and coauthors determined that both in yeast and E, coli cells 
there is a definite correlation between the identity of the N-terminal residue and 
the half-life of the corresponding protein (the N-end rule). These findings 
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suggested that the residue located at the N-terminus of the protein might play an 
important role in determining the fate of the protein inside bacterial and yeast 
cells. The data herein demonstrate that in diflferent mammalian cells, depending 
on the identity of the N-terminal residue, the half-life of firefly luciferase fusion 
5 proteins might vary from 420 to 70 minutes, suggesting that the N-end rule 
functions in mammalian cells. Nevertheless, the N-end rule in mammalian cells 
is surprisingly different from the N-end rule described earlier for yeast and E. 
colt Indeed, arginine and lysine residues were identified as the most 
destabilizing residues both in yeast and bacteria (Varshavsky, 1992). In contrast, 

1 0 these residues were just moderately destabilizing in manmialian cells. In all cell 
lines tested, a glutamic acid residue had the most dramatic effect on the stability 
of firefly luciferase. At the same time^ according to Varshavsky (1992), this 
residue in yeast had a moderate destabilizing effect and in E, coli glutamic acid 
had no destabilizing effect at all. 

15 The data demonstrate that, in addition to the nature of the host cells, the 

structure of the protein may have an effect on the N-end rule. Indeed, by 
introducing a small deletion in the area that, after deubiquitination, becomes the 
N-terminus, the relative destabilizing properties of histidine and glutamic acid 
residues could be changed. As a result, in COS-7 cells, luciferase fusion proteins 

20 with a N-terminal histidine residue became even more unstable than the 
corresponding protein with a glutamic acid residue at the N-terminus. It is 
important to mention that this effect was cell-specific. In CHO cells, the same 
deletion reduced the half-life of the protein possessing a histidine residue at the 
N-terminus but did not change the half-life of the protein possessing a glutamic 

25 acid residue at the N-terminus (see Figure 6) and, as a result, did not change the 
status of glutamic acid as a more efficient destabilizing residue than histidine. 

Moreover, the addition of a mODC fragment to the C-terminus of fu-efly 
luciferase changed the half-life of corresponding proteins but at the same time 
did not alter the destabilizing effect of a glutamic acid residue relative to 

30 argmine and tyrosine. Further, glutamic acid appears to be the most efficient 
destabilizing residue in the case of Renilla luciferase (data not presented). 
Therefore, the N-end rule determined for one protein may provide a useful 
guidance to destabilization of other proteins. 
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In maimnalian cells, the N-degron dependent degradation pathway may 
function less efficiently than it does in yeast cells. Indeed, by positioning Arg, 
Lys, Phe, Leu, Trp, His, Asp or Asn at the N-tenninus of p-galactosidase, 
Varshavsky and coauthors (Bachmair et al., 1986) were able to reduce the half- 
5 Ufe of P-galactosidase in yeast from 20 hours to 2-3 minutes. At the same time, 
in a mammalian cell, even with glutamic acid at the N-terminus, firefly 
luciferase had a half-life of greater than 45-50 minutes. When compared to the 
protein degradation signal contained within the C-ODC, N-degron alone does 
not provide a superior approach to the destabiUzation of proteins. Nevertheless, 

10 the data demonstrate that N-degron and C-ODC can complement each other and 
the combination of these two degradation signals on the same protein results in 
an increased rate of protein degradation. Using this combination of degradation 
signals, a firefly luciferase was generated that in mammahan cells has the 
shortest half-life among currently described reporter proteins. In contrast, a 

15 protein having a degradation signal from listeriolysin 0 and from murine C-ODC 
had a rate of protein degradation which was similar to a protein having a 
degradation signal from murine C-ODC (data not shown). 

The rapid turnover of reporter proteins in the cell may be invaluable for 
monitoring fast processes in the cell. Additionally, destabilized reporters can 

20 allow for a substantial reduction of time in high- throughput screening 

experiments. One major disadvantage of destabihzed reporter proteins is related 
to the fact that, because of reduced quantities of such proteins in the cell, the 
signal available for detection and analysis is weaker than the signal generated by 
wild-type reporter proteins. For example, cells producing firefly luciferase fusion 

25 proteins that possess both N-degron and C-ODC emit almost ten times less Ught 
than the same cells producing wild-type luciferase (see Figure 7). Nevertheless, 
optimization of the sequence encoding reporter protein provides a useful 
approach to overcome this limitation. Indeed, by using this approach, the signal 
emitted by cells producing destabihzed firefly luciferase was increased almost 

3 0 eight-fold without affecting the half-life of the reporter. 
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described in relation to certain preferred embodiments thereof, and many details 
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WHAT IS CLAIMED IS: 

1 . An isolated nucleic acid molecule comprising a nucleic acid sequence 
encoding a fusion polypeptide comprising a reporter protein and at least two 

5 different heterologous protein destabilization sequences, which fusion 

polypeptide has a reduced half-life relative to a corresponding reporter protein 
which lacks the heterologous protein destabilization sequences or has a reduced 
half-Kfe relative to a corresponding reporter protein which has one of the 
heterologous protein destabilization sequences, wherein ubiquitin is not one of 
10 the two different heterologous protein destabilization sequences. 

2. An isolated nucleic acid molecule comprising a nucleic acid sequence 
comprising an open reading frame for a reporter protem and at least two 
heterologous destabilization sequences, wherem one of the heterologous 

1 5 destabilization sequences is a mRNA destabilization sequence and another is a 
heterologous protein destabilization sequence which is not ubiquitin. 

3. An isolated nucleic acid molecule comprising a nucleic acid sequence 
comprising an open reading frame for a luciferase and at least one heterologous 

20 destabilization sequence, wherein a majority of codons in the open reading frame 
for the luciferase are codons which are preferentially employed in a selected host 
cell 

4. The isolated nucleic acid molecule of claim 1, 2 or 3 further comprising a 
25 promoter operably linked to the nucleic acid sequence. 

5. The isolated nucleic acid molecule of claim 4 wherein the promoter is a 
regulatable promoter. 

30 6. The isolated nucleic acid molecule of claim 5 wherein the promoter is an 
inducible promoter. 

7. The isolated nucleic acid molecule of claim 5 wherein the promoter is a 
repressible promoter. 
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8. The isolated nucleic acid molecule of claim 1 further comprising a 
heterologous mRNA destabilization sequence. 

5 9. The isolated nucleic acid molecule of claim 2 or 8 wherein the mRNA 
destabilization sequence is 3* to the nucleic acid sequence. 

10. The isolated nucleic acid molecule of claim 1 or 2 wherein the nucleic 
acid sequence encoding at least the reporter protein is optimized for expression 

10 in a host cell. 

11. The isolated nucleic acid molecule of claim 1 or 2 wherein the reporter 
protein encodes a luciferase. 

15 12. The isolated nucleic acid molecule of claim 1 wherein the reporter 
protein encodes a beetle luciferase. 

13. The isolated nucleic acid molecule of claim 12 wherein the reporter 
protein encodes a click beetle luciferase. 

20 

14. The isolated nucleic acid molecule of claim 1 wherein the reporter 
protein encodes an anthozoan luciferase protein. 

15. The isolated nucleic acid molecule of claim 3 wherein the heterologous 
25 destabilization sequence is a protein destabilization sequence. 

16. The isolated nucleic acid molecule of claim 3 wherein the heterologous 
destabilization sequence is a mRNA destabilization sequence. 

30 17. The isolated nucleic acid molecule of claim 1, 2 or 3 wherein nucleic 
acid sequence comprises SEQ ID NO:47, SEQ ID NO:48, SEQ ED NO:49, SEQ 
ID NO:66, SEQ ID NO:69, SEQ ID NO:70, SEQ ID N0:71, SEQ ID NO:72, 
SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:753 SEQ ID NO:76, SEQ ID 
NO:77, SEQ ID NO:78, SEQ ID NO:79, SEQ ID NO:80, or a fragment thereof 
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that encodes a fusion polypeptide with substantially the same activity as the 
corresponding full-length fusion polypeptide encoded by SEQ ID NO:47, SEQ 
ID NO:48, SEQ ID NO:66, SEQ ID NO:69, SEQ ED NO:70, SEQ ID N0:71, 
SEQ ID NO:72, SEQ IDNO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID 
5 NO:76, SEQ ID NO:77, SEQ ID NO:78, SEQ ID NO:79 or SEQ ID NO:80. 

1 8. The isolated nucleic acid molecule of claim 1 further comprising a 
mRNA destabilization sequence. 

10 19. The isolated molecule of claim 1 8 wherein one protein destabilization 
sequence is a PEST sequence. 

20. The isolated nucleic acid molecule of claim 1 or 2 wherein one 
heterologous protein destabilization sequence is a PEST sequence. 

15 

2 1 . The isolated nucleic acid molecule of claim 1 or 2 wherein one 
heterologous protein destabiUzation sequence is from the C-tenninus of a 
mammalian ornithine decarboxylase. 

20 22. The isolated nucleic acid molecule of claim 1 or 2 wherein one 
heterologous protein destabilization sequence is a mutant ornithine 
decarboxylase sequence. 

23 . The isolated nucleic acid molecule of claim 2 1 wherein the mutant 
25 ornithine decarboxylase sequence has an amino acid substitution at a position 

corresponding to position 426, 427, 428, 430, 431, 433, 434, 439 or 448 of 
murine ornithine decarboxylase. 

24. The isolated nucleic acid molecule of claim 1 or 2 wherein one 

30 heterologous protein destabilization sequence is CLl, CL2, CL6, CL9, CLIO, 
CLl 1, CL12, CL15, CL16, CL17 or SL17. 
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25. The isolated nucleic acid molecule of claim 1 or 2 wherein one 
heterologous protein destabilization sequence is at the C-terminus of the reporter 
protein. 

5 26. The isolated nucleic acid molecule of claim 1 or 2 wherein one 

heterologous protein destabilization sequence at the N^terminus of the reporter 
protein. 

27. The isolated nucleic acid molecule of claim 2 further comprising an 
1 0 ubiquitin polypeptide at the N-terminus of the fusion polypeptide. 

28. The isolated nucleic acid molecule of claim 27 wherein one of the 
heterologous protein destabilization sequences is at the C-terminus of ubiquitin. 

1 5 29. The isolated nucleic acid molecule of claim 28 wherein one of the 
heterologous protein destabilization sequences comprises a glutamic acid or 
arginine residue. 

30. The isolated nucleic acid molecule of claim 10 which encodes a fusion 
20 polypeptide with a half-Ufe of expression of about 20 minutes. 

3 1 . The isolated nucleic acid molecule of claim 10 which encodes a fusion 
polypeptide with a half-life of expression of about 30 minutes. 

25 32. The isolated nucleic acid molecule of claim 1 5 wherein the heterologous 
protein destabilization sequence is a PEST sequence. 

33. The isolated nucleic acid molecule of claim 15 wherein the heterologous 
protein destabilization sequence is from the C-tenninus of a mammahan 

30 ornithine decarboxylase. 

34. The isolated nucleic acid molecule of claim 1 5 wherein the heterologous 
protein destabilization sequence is CLl, CL2, CL6, CL9, CLIO, CLll, CL12, 
CL15, CL16, CL17orSL17. 
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35. A vector comprising the nucleic acid molecule of claim 1 , 2 or 3. 

36. The vector of claim 35 wherein the nucleic acid molecule is operably 
5 linked to a regulatable promoter, 

37. The vector of claim 35 wherein the promoter is a repressible promoter. 

38. The vector of claim 35 wherein the nucleic acid molecule comprises SEQ 
10 ID NO:49, SEQ ID NO:75, SEQ ID NO:76, SEQ ID NO:77, SEQ ID NO:78, 

SEQ ID NO:79, SEQ ID NO:80 or a fragment thereof that encodes a fusion 
polypeptide with substantially the same activity as the corresponding full-length 
fusion polypeptide encoded by SEQ ID NO:49, SEQ ID NO:75, SEQ ID NO:76, 
SEQ ID NO:77, SEQ ID NO:78, SEQ ID NO:79 or SEQ ID NO:80. 

15 

39. A fusion polypeptide encoded by the nucleic acid molecule of claim 1 , 2 
or 3. 

40. The fusion polypeptide of claim 39 wherein the reporter protein is 
20 chloramphenicol acetyltransferase, luciferase, beta-glucuronidase or beta- 

galactosidase. 

41 . A host cell comprising the vector of claim 35. 

25 42. The host cell of claim 4 1 which is stably transfected with the vector that 
encodes a fusion polypeptide comprising a luminescent protein. 

43. The host cell of claim 42 wherein the signal emitted by the host cell 
comprising the vector is greater than the signal emitted by a corresponding host 

30 cell comprising a vector which lacks one or more of the destabilization 
sequences. 

44. A stable cell line comprising the vector of claim 35 wherein the signal 
emitted by the reporter protein is equal to or greater than a signal emitted by a 
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corresponding stable cell line comprising a vector which lacks one or more of 
the heterologous destabilization sequences. 

45. A method to detect a reporter protein in a cell, comprising: 
5 a) contacting a cell with the vector of claim 35; and 

b) detecting or determining the presence or amount of the reporter protein 
in the cell or a lysate thereof 
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Figure 1 
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Figure 2 
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Figure 3 
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Figure 4 
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Figure 5 
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Figure 6 
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Figure 7 
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Figure 9 
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Figure 10 A 
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Figure 10 B 
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Figure 1 \ 
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Figure 1^ 
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